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Foreword 


The phonetic detail of the pronunciation of British English has already 
been described in several excellent works, notably those of Daniel 
Jones. This present book, written after a number of years of teaching 
the spoken language both to English students and to foreign learners, 
Sets out to place the phonetics of British English in a larger framework 
than has been customary. For this reason, emphasis is given to the 
function of the spoken medium as a form of communication. Some 
treatment of the historical background and the linguistic implications 
of the present sound system is included, as well as information 
concerning the acoustic nature of English sounds. Those sections in 
Part II, in which detailed descriptions of the realizations of phonemes 
are given, deal with spelling forms, articulatory and acoustic features, 
variants and chief historical sources. In addition, throughout Parts Il 
and III, general advice to the foreign learner is included. 

The book is intended to serve as a general introduction to the subject 
which will encourage the reader to consult more specialized works on 
particular aspects. Though my own views and observations intrude 
both in the material and in its presentation, much of the information 
given is derived from the numerous sources quoted in the Bibliography. 
In particular, new evaluations, which seem to me to reflect more nearly 
the current trend of RP forms, are made of the phonetic characteristics 
of certain phonemes. In the acoustic field, where so much remains to be 
investigated and where research proceeds so rapidly, an attempt has 
been made to sum up the results of wark done in the post-war period, 
though many of the conclusions must as yet be regarded as tentative. 
It was tempting to apply to British English a logical, elegant, and 
economical phonemic analysis such as 1s now commonplace in the 
United States, involving a very much simplified phonemic notation. If 
this has not been done, it is mainly because a type of analysis was 
required which was explicit on the phonetic level as well as reasonably 
tidy on the phonemic level; it seemed easier, for instance, to deal with 
phonetic developments and variants in terms of the largely traditional 
(for British English) transcription which has been used. 

Throughout the book, the influence of my teachers, Professor Daniel 
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Jones and Dr. H. N. Coustenoble, will be obvious. To them my sincere 
thanks are due, not only for their teaching over the past twenty-five 
years but also for the example of dedication which they gave me. My 
gratitude is also due to Professor D. B. Fry and all my colleagues of the 
Department of Phonetics, University College, London, whose brains I 
have constantly picked during the writing of this book. In particular, 
I have valued the help of Mr. J. D. O’Connor and Dr. A. J. Fourcin, 
who have read sections of the book, made corrections, and suggested 
improvements. I am also much indebted to Professor Randolph Quirk 
for his helpful comments on several points of Old English phonology. I 
am most grateful, too, to Mr. J. C. Wells, who has generously allowed 
me to use unpublished figures resulting from his work on the formants 
of RP vowels. 


A. C. GIMSON, 
University College, 
London. 
December, 1961. 


Preface to Second Impression 


A number of mistakes have been corrected in this second impression. 
My grateful thanks go to all those who have pointed out errors, par- 
ticularly to Mrs. H. J. Uldall and Mr. J. C. Wells for their corrections 
and comments, and to Mr. G. Perren of the British Council, London, 
who has kindly allowed me to use on p. 219 his amended figures for the 
English consonant frequencies. 


ACE 
November, 1963. 


Preface to the Second Edition 


In this revised edition, a number of sections have been modified or 
expanded in the light of evidence which has appeared since the first 
publication of this book. The changing status of RP, especially 
amongst young people, has necessitated the re-writing of the pages 
dealing with this matter, and it has seemed to me worthwhile to add a 
section on the current problems of the intelligibility of spoken English 
in the world. The chief additions, however, are concerned with the 
phonotactic possibilities of English, the permissible variations in the 
phonemic components of English words and the frequency of occur- 
rence of monosyllables and polysyllables in continuous speech. Finally, 
the bibliography, while remaining selective, has been considerably 
extended. 


A.C. G. 
March, 1970. 
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Preface to the Third Edition 


Ten years after the appearance of the second edition, I have been able in 
this new version to introduce a number of small amendments. More sig- 
nificantly, I have added a new chapter which is concerned with the 
problems of teaching the pronunciation of English to foreign learners. 
This addition reflects the widespread use of the book as a text in the 
teaching of English as a foreign language. The style of the appended 
chapter inevitably differs from that of the rest of the book. The first 
eleven chapters remain essentially descriptive within a given theoretical 
framework. The new chapter is much more speculative, since it con- 
tains opinions and frequently offers advice to the reader. The opinions 
and advice are, however, the result of my experience gained in some 
thirty-five years of teaching foreign students and may therefore be 
found to be of some value to others. 


A. C.G. 
January, 1980. 


Preface to the Fourth Edition 


Despite the increasing number of books concerning phonetics and 
phonology, none has been published to supersede this volume which 
has been the most comprehensive and authoritative work on the pro- 
nunciation of English for almost thirty years. Clearly, it can remain so 
only if it is regularly updated; and in this fourth edition I have tried to 
be thorough whilst respecting the original spirit of the work, a task 
which I have undertaken as one who over seventeen years was succes- 
sively a student, colleague and collaborator of Professor Gimson. 

General updating is to be found in every chapter with regard to recent 
research and current terminology, the former being reflected in the 
revised bibliography. More particularly, I have expanded Section 3 (on 
the Word and Connected Speech) to include explicit accounts of some 
of the rules concerning such topics as suffix formation, stress shift, 
smoothing and elision. The book has always dealt with these implicitly 
and still provides ample exemplification, but it seemed that summaries 
of certain processes and patterning would enhance its usefulness as an 
important reference work. It also seemed appropriate that I should add 
some information in this section on stylistic variation in RP, as I was 
asked to carry out research in this field by Professor Gimson who then 
supervised my doctoral thesis. The generalizations that I feel able to 
make here are based on recorded data taken from twenty hours of spon- 
taneous conversation and have not previously been published in any 
detail. 

Similarly, the tentative comments that I have felt able to add to 
Chapter 6 on current changes in RP are based on pronunciations 
observed and elicited in connection with lexicographical work carried 
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out by Professor Gimson and myself. The replacement of the symbol 
/au/ by /au/ in this edition brings the transcription in line with the 
English Pronouncing Dictionary. 

It is to be hoped that any changes and additions that I have made will 
serve to provide such clear and up-to-date information as 1s needed by 
another generation of readers who hope to find comprehensive intro- 
ductory coverage of the pronunciation of English within a single 
volume. 


Susan Ramsaran 
University College, 
London 
August, 1988. 


The publishers wish to thank Dr Alan Cruttenden for his invaluable 
assistance in the production of this new edition. 


List of Phonetic Symbols and Signs 


Cx CAE VO 0 DOS O 8 


NY O 


Hom OH mu 


Cardinal Vowel no. 4 (approximately as in French patte); used for 
first element of Eng. diphthong [al] 

front vowel between open and half-open (Eng. vowel in cat) 

Cardinal Vowel no. 5 (approximately as in French pas); used for 
Eng. [a:] in car 

open rounded Cardinal Vowel no. 5 (Eng. vowel in dog) 

voiced bilabial plosive (Eng. b in labour) 

voiced ingressive bilabial plosive 

voiced bilabial fricative 

voiceless palatal plosive 

voiceless palatal fricative 

Cardinal Vowel no. 6 (approximately as in German Sonne); used 

for Eng. [9:] in saw, and first element of diphthong [91] 

voiced alveolar plosive (Eng. d in lady) 

voiced ingressive alveolar plosive 

voiced dental fricative (Eng. th in other) 

Cardinal Vowel no. 2 (approximately as in French thé); used for 
Eng. [e] in bed, and first element of diphthong [e1] 

unrounded central vowel (Eng. initial and final vowels in another) 

retroflexed central vowel (American er in water) 

Cardinal Vowel no. 3 (approximately as in French pere); used for 
first element of diphthong [eo] 

unrounded central vowel (Eng. vowel in bird) 

voiceless labio-dental fricative (Eng. fin four) 

voiced palatal plosive 

voiced velar plosive (Eng. g in eager) 

voiced ingressive velar plosive 

voiceless glottal fricative (Eng. A in house) 

voiced glottal fricative (Sometimes Eng. A in behind) 

Cardinal Vowel no. 1 (approximately as in French sí); used for 
Eng. [i:] in see 

unrounded central close vowel 

centralized unrounded half-close vowel (Eng. vowel in sit) 

palatal unrounded semi-vowel (Eng. y in you) 
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linguo-alveolar tap (sometimes r in Eng. very) 

voiceless velar plosive (Eng. c in car) 

voiced alveolar lateral continuant (Eng. / in /ay) 

voiced alveolar lateral continuant with velarization (Eng. // in ill) 

voiceless alveolar lateral fricative (Welsh //) 

voiced bilabial nasal (Eng. m in me) 

voiced labio-dental nasal (Eng. m in comfort) 

unrounded Cardinal Vowel no. 8 

voiced alveolar nasal (Eng. n in no) 

voiced velar nasal (Eng. ng in sing) 

voiced palatal nasal (French gn in vigne) 

Cardinal Vowel no. 7 (approximately as in French eau) 

rounded Cardinal Vowel no. 2 (approximately as in French peu) 

open rounded Cardinal Vowel no. 3 (approximately as in French 
peur) 

voiceless dental fricative (Eng. th in thing) 

voiceless bilabial plosive (Eng. p in pea) 

linguo-alveolar trill (Scottish, Italian r); also used for Eng. rin red 

voiced post-alveolar approximant (Eng. r in red) 

voiced retroflex approximant 

voiced uvular trill 

voiced uvular fricative or approximant 

voiceless alveolar fricative (Eng. s in see) 

voiceless palato-alveolar fricative (Eng. sh in she) 

voiceless alveolar plosive (Eng. ¢ in tea) 

voiceless alveolar click 

Cardinal Vowel no. 8 (approximately as in French doux); used for 
Eng [u: ] in do 

central rounded close vowel 

centralized rounded half-close vowel (Eng. u in put) 

voiced labio-dental fricative (Eng. v in ever) 

unrounded Cardinal Vowel no. 6; used for Eng. vowel in cup 

labio-dental approximant 

labial-velar semi-vowel (Eng. w in we) 

voiceless labial-velar fricative (Sometimes Eng. wh in why) 

voiceless velar fricative (Scottish ch in loch) 

rounded Cardinal Vowel no. 1 (approximately as in French du) 

voiced palatal lateral continuant (Italian gl in egli) 

unrounded Cardinal Vowel no. 7 

voiced velar fricative 

voiced alveolar fricative (Eng. z in /azy) 

voiced palato-alveolar fricative (Eng. s in measure) 

voiceless bilabial fricative 

voiceless alveolar lateral click 

glottal plosive (stop) 
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indicates full length of preceding vowel 

indicates half length of preceding vowel 

main accentual stress or pitch prominence on following syllable 

secondary accentual stress on following syllable 

high unaccented pre-nuclear syllable 

high falling pitch 

low falling pitch 

high rising pitch 

low rising pitch 

falling-rising pitch 

rising-falling pitch 

syllable carrying secondary accent 

syllable, immediately following nucleus, carrying secondary 
accent based on quality/quantity 

syllable carrying primary (nuclear) accent 

unaccented syllable 

nasalization, e.g. [0] 

centralization, e.g. [6] 

more open quality, e.g. [q] 

closer quality, e.g. [9] 

devoiced lenis consonant, e.g. [z] (above in the case of [ġ, 3, 9]) 

syllabic consonant, e.g. [n] (above in the case of [n)) 

dental articulation, e.g. [t] 


í ] phonetic transcription 
/ / phonemic transcription 


Other Symbols 


> changed to 
< developed from 


TT 


RP (figs. 9-27) 
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Part I 
Speech and Language 





1 


Communication 


1.1 Speech 


One of the chief characteristics of the human being is his ability to com- 
municate to his fellows complicated messages concerning every aspect 
of his activity. A man possessing the normal human faculties achieves 
this exchange of information mainly by means of two types of sensory 
stimulation, auditory and visual. The child will learn from a very early 
age to respond to the sounds and tunes which his elders habitually use in 
talking to him; and, in due course, from a need to communicate, he will 
himself begin to imitate the recurrent sound patterns with which he has 
become familiar. In other words, he begins to make use of speech; and 
his constant exposure to the spoken form of his own language, together 
with his need to convey increasingly subtle types of information, leads 
to a rapid acquisition of the framework of his spoken language. Never- 
theless, with all the conditions in his favour, a number of years will pass 
before he has mastered not only the sound system used in his com- 
munity but also has at his disposal a vocabulary of any extent or is 
entirely familiar with the syntactical arrangements in force in his 
language system. It is no wonder, therefore, that the learning of 
another language later in life, acquired artificially in brief and sporadic 
spells of activity and without the stimulus arising from an immediate 
need for communication, will tend to be tedious and rarely more than 
partially successful. In addition, the more firmly consolidated the basis 
of a first language becomes or, in other words, the later in life that a 
second language is begun, the more the learner will be subject to 
resistances and prejudices deriving from the framework of his original 
language. It may be said that, as we grow older, the acquisition of anew 
language will normally entail a great deal of conscious, analytical 
effort, instead of the child’s ready and facile imitation. 


1.2 Writing 


Later in life the child will be taught the conventional visual representa- 
tion of speech—he will learn to use writing. To-day, in considering 
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those languages which have long possessed a written form, weare apt to 
forget that the written form is originally an attempt at reflecting the 
spoken language and that the latter precedes the former for both the 
individual and the community. Indeed, in many languages, so parallel 
are the two forms felt to be that the written form may be responsible for 
changes in pronunciation or may at least tend to impose restraints upon 
its development. In the case of English, this sense of parallelism, rather 
than of derivation, may be encouraged by the obvious lack of consistent 
relationship between sound and spelling. A written form of English, 
based on the Latin alphabet, has existed for more than 1,000 years and, 
though the pronunciation of English has been constantly changing 
during this time, few basic changes of spelling have been made since the 
fifteenth century. The result is that written English is often an inade- 
quate and misleading representation of the spoken language of to-day. 
Clearly it would be unwise, to say the least, to base our judgments con- 
cerning the spoken language on prejudices derived from the ortho- 
graphy. Moreover, if we are to examine the essence of the English 
language, we must make our approach through the spoken rather than 
the written form. Our primary concern will be the production, trans- 
mission, and reception of the sounds of English—in other words, the 
Phonetics of English. 


1.3 Language 


From the moment that we abandon orthography as our starting point, 
itis clear that the analysis of the spoken form of English is by no means 
simple. Each of us uses an infinite number of different speech sounds 
when we speak English. Indeed, it is true to say that it is difficult to pro- 
duce two sounds which are precisely identical from the point of view of 
instrumental measurement: two utterances by the same person of the 
word cat may well show quite marked differences when measured 
instrumentally. Yet we are likely to say that the same sound sequence 
has been repeated. In fact we may hear clear and considerable dif- 
ferences of quality in the vowel of cat as, for instance, in the London 
and Manchester pronunciations of the word; yet, though we recognize 
differences of vowel quality, we are likely to feel that we are dealing 
with a ‘variant’ of the ‘same’ vowel. It seems, then, that we are con- 
cerned with two kinds of reality: the concrete, measurable reality of 
the sounds uttered, and another kind of reality, an abstraction made in 
our minds, which appears to reduce this infinite number of different 
sounds to a ‘manageable’ number of categories. In the first, concrete, 
approach, we are dealing with sounds in relation to speech; at the 
second, abstract, level, our concern is the behaviour of sounds in a 
particular /anguage. A language is a system of conventional signals 
used for communication by a whole community. This pattern of con- 
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ventions covers a system of significant sound units (the phonemes), the 
inflexion and arrangement of ‘words’, and the association of meaning 
with words. An utterance, an act of speech, is a single concrete mani- 
festation of the system at work. As we have seen, several utterances 
which are plainly different on the concrete, phonetic level may fulfil the 
same function, 1.e. are the ‘same’, on the systematic language level. It is 
important in any analysis of spoken language to keep this distinction in 
mind and we shall later be considering in some detail how this dual 
approach to the utterance is to be made. It is not, however, always 
possible or desirable to keep the two levels of analysis entirely separate: 
thus, as we shall see, we will draw upon our knowledge of the linguis- 
tically significant units to help us in determining how the speech con- 
tinuum shall be divided up on the concrete, phonetic level; and again, 
our classification of linguistic units will be helped by our knowledge of 
their phonetic features. 


1.4 Redundancy 


Finally, it is well to remember that, although the sound system of our 
spoken languages serves us primarily as a medium of communication, 
its efficiency as such an instrument of communication does not depend 
upon the perfect production and reception of every single element of 
speech. A speaker will, in almost any utterance, provide the listener 
with far more cues than he needs for easy comprehension. In the first 
place, the situation, or context, will itself delimit very largely the pur- 
port of an utterance. Thus, in any discussion about a zoo, involving a 
statement such as ‘We saw the lions and tigers’, we are predisposed by 
the context to understand /ions, even if the n is omitted and the word 
actually said is /iars. Or again, we are conditioned by grammatical 
probabilities, so that a particular sound may lose much of its signi- 
ficance, e.g. in the phrase ‘These men are working’, the quality of the 
vowel in men is not as vitally important for deciding whether it is a 
question of men or man as it would be if the word were said in isolation, 
since here the plurality is determined in addition by the demonstrative 
adjective preceding men and the verb form following. Then again, there 
are particular probabilities in every language as to the different com- 
binations of sounds which will occur. Thus in English, if we hear an 
initial th sound [6], we expect a vowel to follow, and of the vowels some 
are much more likely than the others. We distinguish such sequences as 
-gl and -dl in final positions, e.g. in beagle and beadle; but this distinc- 
tion is not relevant initially, so that evenif d/oves is said, we understand 
gloves. Or again, the total rhythmic shape of a word may provide an 
important cue to its recognition: thus, in a word such as become, the 
general rhythmic pattern may be said to contribute as much to the 
recognition of the word as the precise quality of the vowel in the first, 
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weakly accented syllable. Indeed, we may come to doubt the relative 
importance of vowels as a help to intelligibility, since we can replace our 
twenty English vowels by the single vowel [ə] in any utterance and still, 
if the rhythmic pattern is kept, retain a high degree of intelligibility. An 
utterance, therefore, will provide a large complex of cues for the 
listener to interpret, but a greal deal of this information will be unneces- 
sary, or redundant, as far as the listener’s needs are concerned. On the 
other hand, such an over-proliferation of cues will serve to offset any 
disturbance such as noise or to counteract the sound quality divergences 
which may exist between speakers of two dialects of the same language. 
But to insist, for instance, upon exaggerated articulation in order to 
achieve clarity may well be to go beyond the requirements of speech asa 
means of communication; indeed, certain obscurations of quality are, 
and have been for many centuries, characteristic of English. Aesthetic 
judgments on speech, such as those which deplore the use of the ‘intru- 
sive’ r, take into account social considerations of asomewhat different 
order from those involved in a study of speech as communication. 


1.5 Phonetics and Linguistics 


This book is primarily concerned with the sound system of English and 
it is proper, as we have seen, that phonetic and phonemic analysis 
should occupy an important place in the study of any language. Indeed, 
when it 1s a question of a language which is being subjected for the first 
time to scientific analysis, it is necessary that some statement of the 
sound system should be made at the very outset, so that a notation can 
be devised for the recording of the language in a written form. Never- 
theless, it should be remembered that phonetic analysis constitutes but 
one step in a linguistic investigation. 

A complete description of the current state of a language must 
provide information on several interrelated levels: — 

(1) Phonology.—The concrete phonetic characteristics (arti- 
culatory, auditory, acoustic) of the sounds used in the language; the 
concrete phonetic level is often separated from the more abstract 
Phonological level which analyses the patterning of sounds in language 
and includes the functional, phonemic behaviour of these sounds for 
distinctive purposes; the combinatory possibilities (syllabic structure or 
Phonotactics) of the phonemes; the nature and use of such prosodic 
features as pitch, stress and length. A study of the phonic substance of 
the language may be accompanied by a description of the written form 
of the language (graphology). 

(2) Lexis.—The total number of word forms which exist. 

(3) Morphology.—The structure of words. 

(4) Syntax.—The system of rules governing the structure of phrases, 
clauses and sentences consisting of words contained in the lexicon. 
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Note. Statements may be made which combine phonemic and 
morphemic features (morphophonemics or morphophonology), e.g. 
the phonemes /s/ and /z/ in cats and dogs are exponents of the same 
morpheme of plurality which might be symbolized as |s]. 

(5) Semantics.—The relation of meaning to the signs and symbols of 
language. 

Other aspects of language which would require investigation include 
the variation of the same language in different regions and social classes 
(dialectology), the influence of context and style upon the form and 
substance of the language; the influence of situation on the interpreta- 
tion of utterances (pragmatics); the behaviour of human beings in their 
production and perception of the language (psycholinguistics); the 
interaction of the language and the society in which it is spoken (socio- 
linguistics). 

Finally, itis clear that the phonology, lexis, syntax and semantics of a 
language are always undergoing change in time. The state of a language 
at any (synchronic) moment must be seen against a background of its 
historical (diachronic) evolution. It is for this reason that this book 
includes information on earlier states of the sound system of English, 
with some speculation on possible developments in the future. 


2 


The Production of Speech 
The Physiological Aspect 


2.1 The Speech Chain 


Any manifestation of language by means of speech is the result of a 
highly complicated series of events. The communication in sound of 
such asimple concept as ‘It’s raining’ involves a number of activities on 
the part of the speaker. In the first place, the formulation of the concept 
will take place at a linguistic level, 1.e. in the brain; the first stage may, 
therefore, be said to be psychological. The nervous system transmits 
this message to the so-called ‘organs of speech’ and these in turn behave 
in aconventional manner, which, as we have learned by experience, will 
have the effect of producing a particular pattern of sound; the second 
important stage for our purposes may thus be said to be articulatory or 
Physiological. The movement of our organs of speech will create 
disturbances in the air, or whatever the medium may be through which 
we are talking; these varying air pressures may be investigated and they 
constitute the third stage in our chain, the physical or acoustic. Since 
communication generally requires a listener as well as a speaker, these 
Stages will be reversed at the listening end: the reception of the sound 
waves by the hearing apparatus (physiological) and the transmission 
of the information along the nervous system to the brain, where the 
linguistic interpretation of the message takes place (psychologi- 
cal). Phonetic analysis has often ignored the role of the listener. But 
any investigation of speech as communication must ultimately be 
concerned with both the production and the reception ends. A number 
of phonetic features, e.g. stress, must be defined in different terms 
according to whether the emphasis is laid on the speaker’s feeling for his 
own speech activity or on the listener’s appreciation of significant 
features. 

Our immediate concern, however, is with the speaker’s behaviour 
and more especially, on the concrete speech level, with the activity 
involved in the production of sounds. For this reason, we must now 
examine the speaker’s articulatory stage (his speech mechanism) to dis- 
cover how the various organs behave in order to produce the sounds of 
speech. 
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2.2 The Speech Mechanism 


Man possesses, in common with many other animals, the ability to 
produce sounds by using certain of his body’s mechanisms. The human 
being differs from other animals in that he has been able to organize the 
range of sounds which he can emit into a highly efficient system of com- 
munication. Non-human animals rarely progress beyond the stage of 
using the sounds they produce as a reflex of certain basic stimuli to 
signal fear, hunger, sexual excitement, and the like. Nevertheless, like 
other animals, man when he speaks makes use of organs whose primary 
physiological function is unconnected with vocal communication, 
namely, those situated in the respiratory tract. 

2.2.1 Thus, the most usual source of energy for our vocal activity is 
provided by an air-stream expelled from the /ungs. There are languages 
which possess sounds not requiring lung (pulmonic) air for their arti- 
culation, and, indeed, in English we have one or two extra-linguistic 
sounds, such as that we write as Tut-tut and the noise of encouragement 
made to horses, which are produced without the aid of the lungs; but all 
the essential sounds of English need lung air for their production. Our 
utterances are, therefore, largely shaped by the physiological limita- 
tions imposed by the capacity of our lungs and by the muscles which 
control their action. We are obliged to pause in articulation in order to 
refill our lungs with air and the number of energetic peaks of exhalation 
which we make will to some extent condition the length of any breath 
group. Syllabic pulses and dynamic stress, both typical of English, are 
directly related to the behaviour of the muscles which activate the lungs. 
In those cases where the lung air-stream is not available for the upper 
organs of speech, as when, after the removal of the larynx, lung air does 
not reach the mouth but escapes from an artificial aperture in the neck, 
a new source of energy, such as stomach air, has to be employed; anew 
source of this kind imposes restrictions of quite a different nature from 
those exerted by the lungs, so that the organization of the utterance into 
groups is changed and variation of energy is less efficiently controlled. 

A number of techniques are available for the investigation of the 
activity in speech of the lungs and their controlling muscles. It is, for 
instance, possible to measure air pressures within the lungs by 
observing the reaction of an air-filled balloon placed in the stomach; 
X-ray photography reveals the gross movements of the ribs and 
muscles; and the technique of electromyography has demonstrated! the 
electrical activity of those respiratory muscles most concerned in 
speech, notably the internal intercostals. 

2.2.2 The air-stream provided by the lungs undergoes important 


l! P, Ladefoged, M. H. Draper, and D. Whitteridge, ‘Syllables and Stress,’ Miscellanea 
Phonetica III, 1958 (International Phonetic Association). 
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modifications in the upper stages of the respiratory tract before it 
acquires the quality of a speech sound. First of all, in the trachea or 
windpipe, it passes through the /arynx, containing the so-called vocal 


folds (see Fig. 1). 


e. Hard 
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Vocal Folds 


FIG. 1.—Organs of speech (schematic diagram). 


The larynx is a casing, formed of cartilage and muscle, situated in the 
upper part of the trachea. lts forward portion is prominent in the neck 
below the chin and is commonly called the ‘Adam’s apple’. Housed 
within this structure from back to front are the vocal folds, two folds of 
ligament and elastic tissue which may be brought together or parted by 
the rotation of the arytenoid cartilages (attached at the posterior end of 
the folds) through muscular action. The inner edge of these folds has a 
length of about 23 mm. in adult men and about 18 mm. in women. The 
opening between the folds is known as the glottis. Biologically, the 
vocal folds act as a valve which is able to prevent the entry into the 
trachea and lungs of any foreign body or which may have the effect of 
enclosing the air within the lungs to assist in muscular effort on the part 
of the arms or the abdomen. In using the vocal folds for speech, the 
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human being has adapted and elaborated upon this original open-or- 
shut function in the following ways (see Fig. 2):— 


anterior 





(c) 


FIG. 2.—Diagrams of the vocal folds as seen from above: (a) tightly closed as 
for [?]; (b) wide open as for breath; (c) loosely together and vibrating as for 
voice. 


a. The glottis may be held tightly closed, with the lung air pent up 
below it. This ‘glottal stop? [?] frequently occurs in English, e.g. when it 
precedes the energetic articulation of a vowel or when it reinforces or 
even replaces p, t, k. It may also be heard in defective speech, such as 
that arising from cleft palate, when [?] may be substituted for the stop 
consonants, which, because of the nasal air escape, cannot be articu- 
lated with proper compression in the mouth cavity. 

b. The glottis may be held open as for normal breathing. 

c. The action of the vocal folds which is most characteristically a 
function of speech consists in their role as a vibrator set in motion by 
lung air—the production of voice, or phonation; this vocal fold vibra- 
tion is a normal feature of all vowels or of such a consonant as z com- 
pared with voiceless s. In order to achieve the effect of voice, the vocal 
folds are brought sufficiently close together that they vibrate when sub- 
jected to air pressure from the lungs. This vibration, of a somewhat 
undulatory character, is caused! by compressed air forcing an opening 
of the glottis and the resultant reduced air pressure permitting the 
elastic folds to come together once more; the vibratory effect may easily 
be felt by touching the neck in the region of the larynx when saying ah or 
z, for instance, on a low note. In the typical speaking voice of a man, 


1The theory of vocal fold vibration here described is known as the aerodynamic- 
myoelastic or tonic theory. Since the war, however, the experimental work of R. Husson 
and his associates has introduced a new explanation of the vibrations, in neuro-muscular 
(or clonic) terms. According to this theory, the vibrations of the vocal folds in speech are 
activated directly by the laryngeal recurrent nerve independently of sub-glottal air- 
pressure (although this latter will always be present if ‘voice’ is to be produced). The 
frequency of vibration (pitch) is also explained as dependent on direct innervation (rather 
than as the result largely of variations of air-pressure) and only the intensity of the larynx 
note as dependent on the force of sub-glottal air. 
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this opening and closing action is likely to be repeated between 100 and 
150 times in a second, i.e. there are that number of vibration cycles per 
second (cps)'; in the case of a woman’s voice, this frequency of vibra- 
tion might well be between 200 and 325 cps. We are able, within limits, 
to vary the speed of vibration of our vocal folds or, in other words, are 
able consciously to change the pitch of the voice produced in the larynx; 
the more rapid the rate of vibration, the higher is the pitch (an extremely 
low rate of vibration being partly responsible for the ‘grunting, creaky’ 
type of voice). Normally the vocal folds come together rapidly and part 
more slowly, the opening phase of each cycle thus being longer than the 
closing phase. This gives rise to ‘modal’ (or ‘normal’) voice which 1s 
used for most of English speech. Other modes of vibration result in 
other voice qualities, most notably breathy and creaky voice which are 
used contrastively ina number of languages. (See also Chapter 5, §5.8.) 
Moreover, we are able, by means of variations in pressure from the 
lungs, to modify the size of the puff of air which escapes at each vibra- 
tion of the vocal folds; in other words, we can alter the amplitude of the 
vibration, with a corresponding change of loudness of the sound heard 
by a listener. The normal human being soon learns to manipulate his 
glottal mechanism so that most delicate changes of pitch and loudness 
are achieved. Control of this mechanism is, however, very largely exer- 
cised by the ear, so that such variations are exceedingly difficult to teach 
to those who are born deaf, and a derangement of pitch and loudness 
control is hable to occur among those who become totally deaf later in 
ie 

d. One other action of the larynx should be mentioned. A very quiet 
whisper may result merely from holding the glottis in the ‘voiceless’ 
position. But the more normal whisper, by means of which we are able 
to communicate with some ease, can be felt to involve energetic 
articulation and considerable stricture in the glottal region. Such a 
whisper may in fact be uttered with an almost total closure of the glottis 
and an escape of air in the region of the arytenoids. Once the vibrations 
of the vocal folds are lost, however, variations of pitch can be only of 
the grossest kind. 

For our purposes in the analysis of English, the most important of 
these four activities are those which result in the production of voiced or 
voiceless sounds and that which produces the glottal stop. 

The behaviour of the vocal folds may be observed in several ways. In 
particular, it is possible by means of the laryngoscope to have a 
Stationary mirrored image of the glottis. Using stroboscopic tech- 
niques, it is possible to obtain a moving record and high-speed films 
have been made of the vocal folds, showing their action in ordinary 


¡The term hertz (Hz) is now generally applied to such a unit of frequency; ‘cps’, a more 
explicit abbreviation, is retained in this book. 
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breathing, producing voice and whisper, and closed as for a glottal 
stop. The situation of the folds renders them difficult to photograph 
with X-rays other than by tomographic techniques. 

2.2.3 The Resonating Cavities The air-stream, having passed 
through the larynx, is now subject to further modification according to 
the shape assumed by the upper cavities of the pharynx and mouth and 
according to whether the nasal cavity is brought into use or not. These 
cavities function as the principal resonators of the note produced in the 
larynx. 

2.2.3.1 The pharyngeal cavity (see Fig. 1) extends from the top of 
the larynx and oesophagus, past the epiglottis and the root of the 
tongue, to the region in the rear of the soft palate. It is convenient to 
identify these sections of the pharynx by naming them: laryngo- 
pharynx, oropharynx, nasopharynx. The shape and volume of this long 
chamber may be considerably modified by the constrictive action of the 
muscles enclosing the pharynx, by the movement of the back of the 
tongue, by the position of the soft palate which may, when raised, 
exclude the nasopharynx, and by the raising of the larynx itself. The 
position of the tongue in the mouth, whether it is advanced or retracted, 
will affect the size of the oropharyngeal cavity; the modifications in 
shape of this cavity should, therefore, be included in the description of 
any vowel. It is a characteristic of some kinds of English pronunciation 
that certain vowels are articulated with a strong pharyngeal contrac- 
tion; in addition, a constriction may be made between the lower rear 
part of the tongue and the wall of the pharynx so that friction, with or 
without voice, is produced, such fricative sounds being a feature of a 
number of languages. 

The pharynx may be observed by means of a laryngoscope or 
nasendoscopy and its constrictive actions are revealed by lateral X-ray 
photography. 

The escape of air from the pharynx may be effected in one of three 
ways: — 

a. The soft palate may be lowered, as in normal breathing, in which 
case the air may escape through the, nose and the mouth. This is the 
position taken up by the soft palate in articulation of the French 
nasalized vowels in such a phrase as un bon vin blanc [€ bõ vé bla], 
the particular quality of such vowels being achieved through the 
function of the nasopharyngeal cavity. 

b. The soft palate may be lowered so that a nasal outlet is afforded to 
the air stream, but a complete obstruction is made at some point in the 
mouth, with the result that, although air enters all or part of the mouth 
cavity, no oral escape is possible. A purely nasal escape of this sort 
occurs in such nasal consonants as [m, n, y] in the English words ram, 
ran, rang. In a snore and some kinds of defective speech, this nasal 
escape may be accompanied by friction between the rear side of the soft 
palate and the pharyngeal wall. 
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c. The soft palate may be held in its raised position, eliminating the 
action of the nasopharynx, so that the air escape is solely through the 
mouth. All normal English sounds, with the exception of the nasal con- 
sonants mentioned, have this oral escape. Moreover, if for any reason 
the lowering of the soft palate cannot be effected, or if there is an 
enlargement of the organs enclosing the nasopharynx or a blockage 
brought about by mucus, it is often difficult to articulate either 
nasalized vowels or nasal consonants. In such speech, typical of 
adenoidal enlargement or obstruction caused by a cold, the French 
phrase mentioned above would have ifs nasalized vowels turned into 
their oral equivalents and the English word morning would have its 
nasal consonants replaced by [b, d, g]. On the other hand, an inability 
to make an effective closure by means of the raising of the soft palate 
—either because the soft palate itself is defective or because an abnor- 
mal opening in the roof of the mouth gives access to the nasal cavity— 
will result in the general nasalization of vowels and the failure to articu- 
late such aral stop consonants as [b, d, g]. This excessive nasalization 
(or hypernasality) is typical of such a condition as cleft palate. 

It is evident that the action of the soft palate is accessible to observa- 
tion by direct means, as well as by lateral X-ray photography; the 
pressure of the air passing through the nasal cavities may be measured 
at the nostrils or within the cavities themselves. 

2.2.3.2 The Mouth Although all the cavities so far mentioned play 
an essential part in the production of speech sounds, most attention has 
traditionally been paid to the behaviour of the cavity formed by the 
mouth. Indeed, in many languages the word tongue is used to refer to 
our speech and language activity. Such a preoccupation with the oral 
cavity is doubtless due to the fact that it is the most readily accessible 
and easily observed section of the vocal tract; but there is in such an 
attitude a danger of gross over-simplification. Nevertheless, it is true 
that the shape of the mouth determines finally the quality of the 
majority of our speech sounds. Far more finely controlled variations of 
shape are possible in the mouth than in any other part of the speech 
mechanism. 

The only boundaries of this oral chamber which may be regarded as 
relatively fixed are, in the front, the teeth; in the upper part, the hard 
palate; and, in the rear, the pharyngeal wall. The remaining organs are 
movable: the lips, the various parts of the tongue, and the soft palate 
with its pendent uvula (see Fig. 1). The lower jaw, too, is capable of 
very considerable movement; its movement will control the gap 
between the upper and lower teeth and also to a large extent the disposi- 
tion of the lips. The space between the upper and lower teeth will often 
enter into our description of the articulation of sounds; in all such 
cases, it is Clear that the movement of the lower jaw is ultimately 
responsible for the variation described. 
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It is convenient for our descriptive purposes to divide the roof of the 
mouth into three parts: moving backwards from the upper teeth, first, 
the alveolar or teeth ridge which can be clearly felt behinu the teeth; 
secondly, the bony arch which forms the hard palate and which varies in 
size and arching from one individual to another; and finally, the soft 
palate or velum which, as we have seen, is capable of being raised or 
lowered, and at the extremity of which is the uvula. All these parts can 
be readily observed by means of a mirror. The main divisions will be 
referred to as: dental, alveolar, palatal (the hard palate), and velar (the 
soft palate). 

(a) Of the movable parts, the lips, whenever the nasal passage is shut 
off, constitute the final orifice of the mouth cavity. The shape which 
they assume will, therefore, affect very considerably the shape of the 
total cavity. They may be shut or held apart in various ways. When they 
are held tightly shut, they form a complete obstruction or occlusion to 
the air-stream, which may either be momentarily prevented from escap- 
ing at all, as in the initial sounds of pat and bat, or may be directed 
through the nose by the lowering of the soft palate, as in the initial 
sound of mat. If the lips are held apart, the positions they assume may 
be summarized under six headings:— 

1. held sufficiently close together over all their length that friction 
occurs between them. Fricative sounds of this sort, with or without 
voice, occur in many languages and the voiced variety [B] is sometimes 
wrongly used by foreign speakers of English for the first sound in word; 

2. held sufficiently far apart for no friction to be heard, yet remain- 
ing fairly close together and energetically spread. This shape is taken up 
for a tense variety of the vowel in see and is known as the spread lip 
position; 

3. held in a relaxed position with a medium lowering of the lower 
jaw. This is the position taken up for the vowel of gef and 1s known as 
the neutral position; 

4. held relatively wide apart, without any marked rounding, as for 
the vowel in card. This 1s the open position; 

5. tightly pursed, so that the aperture is small and rounded, as in the 
vowel of do, or more markedly so in the French vowel of doux. This is 
the close rounded position; 

6. held wide apart, but with slight projection and rounding, asin the 
vowel of got. This is the open rounded position. 

Variations of these six positions may be encountered, e.g. in the 
vowel of saw, for which a type of lip rounding between open and close is 
commonly used. It will be seen from the examples given that lip position 
is particularly significant in the formation of vowel quality. English 
consonants, on the other hand, with the exception of such sounds as 
[p, b, m, w] whose primary articulation involves lip action, will tend to 
share the lip position of the adjacent vowel. In addition, the lower lip is 
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an active articulator in the pronunciation of [f, v], a light contact being 
made between the lower lip and the upper teeth. 

(b) Of all the movable organs within the mouth, the tongue is by far 
the most flexible and is capable of assuming a great variety of positions 
in the articulation of both vowels and consonants. The tongue 1s acom- 
plex muscular structure which does not show obvious sections; yet, 
since its position must often be described in considerable detail, certain 
arbitrary divisions are made. When the tongue is at rest, with its tip 
lying behind the lower teeth, that part which lies opposite the hard 
palate is called the front and that which faces the soft palate is called the 
back, with the region where the front and back meet known as the 
centre. This whole upper area of the tongue 1s sometimes referred to as 
the dorsum. The tapering section facing the teeth ridge is called the 
blade and its extremity the tip. The tip and blade region is sometimes 
known as the apex. The edges of the tongue are known as the rims. 

Generally, in the articulation of vowels, the tongue-tip remains low 
behind the lower teeth. The body of the tongue may, however, be 
‘bunched up’ in different ways, e.g. the front may be the highest part as 
when we say the vowel of he; or the back may be most prominent as in 
the case of the vowel in who; or the whole surface may be relatively low 
and flat as in the case of the vowel in ah. Such changes of shape can 
easily be felt if the above words are said in succession. These changes, 
moreover, together with the variations in lip position, have the effect of 
modifying very considerably the size of the mouth cavity and of 
dividing this chamber into two parts: that cavity which is in the forward 
part of the mouth behind the lips and that which is in the rear in the 
region of the pharynx. 

The various parts of the tongue may also come into contact with the 
roof of the mouth. Thus, the tip, blade, and rims may articulate with 
the teeth as for the th sounds in English, or with the upper alveolar ridge 
as in the case of t, d, s, z, n; or the apical contact may be only partial as 
in the case of / (where the tip/blade makes firm contact whilst the rims 
make none) or intermittent as in a rolled r. In some languages, notably 
those of India, the tip contact may be retracted to the very back of the 
teeth ridge or even slightly behind it; the same kind of retroflexion, 
without the tip contact, is typical of some kinds of English r, e.g. those 
used in South-West England and America. 

The front of the tongue may articulate against or near to the hard 
palate. Such a raising of the front of the tongue towards the palate 
(palatalization) is an essential part of the [f, 3] sounds in English words 
such as she and measure, being additional to an articulation made 
between the blade and the alveolar ridge; or again, it is the main feature 
of the [j] sound initially in yield. 

The back of the tongue can form a total obstruction by its contact 
with the soft palate, raised in the case of [k, g] and lowered for [n] as in 
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sing; or again, there may merely be a narrowing between the soft palate 
and the back of thetongue, so that friction of the type occurring finally 
in the Scottish loch is heard. And finally, the uvula may vibrate against 
the back of the tongue, or there may be a narrowing in this region which 
causes uvular friction. 

It will be seen from these few examples that, whereas for vowels the 
tongue 1s generally held in a position which is convex in relation to the 
roof of the mouth, some consonant articulations, such as the Southern 
English rin red and the / in table, will involve the ‘hollowing’ of the 
body of the tongue so that it has, at least partially, a concave relation- 
ship with the roof of the mouth. 

Moreover, the surface of the tongue, viewed from the front, may 
take on various forms: there may be a narrow groove running from 
back to front down the centre line as for s; or the grooving may be very 
much more diffuse as in the case of a sh [f] sound; or again, the whole 
tongue may be laterally contracted, with or without a depression in the 
centre (sulcalization), as is the case with various kinds of r sounds. 

(c) The oral speech mechanism is readily accessible to direct observa- 
tion as far as the lip movements are concerned and also many of the 
tongue movements which take place in the forward part of the mouth. 
A lateral view of the shape of the tongue over all its length and its 
relationship with the palate and the velum may be obtained by means of 
still and moving X-ray photography. Formerly, it was thought to be 
necessary for a thin metal chain to be laid along the surface of the 
tongue in order to make its contours visible under X-rays; but, to-day, a 
coating of barium solution would be sufficient to make its lateral 
outline visible. Such a procedure permits photography of tongue 
positions not only of vowels but also of many consonant articulations. 
It is not, however, to be expected that an X-ray photograph of the 
articulation of, say, the vowel in cat will show an identical tongue 
position for the pronunciation of a number of individuals. Not only is 
the sound itself likely to be different from one individual to another, 
but, even if the sound is for all practical purposes the ‘same’, the tongue 
positions may be different, since the boundaries of the mouth cavity are 
not identical for two speakers; and, in any case, two sounds judged to 
be the same may be produced by the same individual with different arti- 
culations. When, therefore, we describe an articulation in detail, it 
should be understood that such an articulation is typical for the sound 
in question, but that variations are to be expected. 

It is often important to know the extent of the area of contact 
between the tongue and the roof of the mouth. Formerly, such informa- 
tion was obtained by means of a false palate, of metal or plastic, made 
to fit exactly over the human palate and covered with a powder which 
would be removed by any contact from the tongue. This false palate 
could be removed after articulation and the contact pattern observed 
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and photographed. Such a procedure was superseded by a technique of 
direct photography of the palate, which was itself covered with a 
powdery substance to show tongue contact. This direct photography 
was not only more rapid than the former method but also had the 
advantage that the articulation was not deformed to the extent it may 
well be after the insertion of even the best fitting false palate. More 
recently the development of the electropalatograph has allowed a Series 
of representations to be made of the changing contacts between the 
tongue and the palate during speech. Electrodes on a false palate 
respond to any tongue contact, the contact points being simultaneously 
registered on a visual display. 


2.3 Articulatory Description 


We have now reviewed briefly the complex modifications which are 
made to the original air-stream by a mechanism which extends from the 
lungs to the mouth and nose. The description of any sound necessitates 
the provision of certain basic information: — 

1. The nature of the aig-stream; usually, this will be expelled by 
direct action of the lungs, but we shall later consider cases where this 1s 
not so. It may also be relevant to assess the force of exhalation. 

2. The action of the vocal folds; in particular, whether they are 
closed, wide apart, or vibrating. 

3. The position of the soft palate, which will decide whether or not 
the sound has nasal resonances. 

4. The disposition of the various movable organs of the mouth, 1.e. 
the shape of the lips and tongue, in order to determine the nature of the 
related oral and upper pharyngeal cavities. 

In addition, it may be necessary to provide other information con- 
cerning, for instance, a particular secondary stricture or tenseness 
which may accompany the primary articulation; or again, when it is a 
question of asound with no steady state to describe, an indication of the 
kind of movement which is taking place. 


3 


The Sounds of Speech— The 
Acoustic and Auditory Aspects 


3.1 Sound Quality 


To complete an act of communication, it is not normally sufficient that 
our speech mechanism should simply function in such a way as to pro- 
duce sounds; these in turn must be received by a hearing mechanism and 
interpreted, after having been transmitted through a medium, such as 
the air, which is capable of conveying sounds. We must now, therefore, 
examine briefly the nature of the sounds which we hear, the characteris- 
tics of the transmission phase of these sounds, and the way in which 
these sounds are perceived by a listener. 

3.1.1 When we listen to a continuous utterance, we perceive an 
everchanging pattern of sound. As we have seen, when it is a question of 
our own language, we are not conscious of all the complexities of 
pattern which reach our ears: we tend consciously to perceive and inter- 
pret only those sound features which are relevant to the intelligibility of 
our language. Nevertheless, despite this linguistic selection which we 
ultimately make, we are aware that this changing pattern consists of 
variations of different kinds: of sound quality—we hear a variety of 
vowels and consonants; of pitch—we appreciate the melody, or intonc- 
tion, of the utterance; of /oudness—we will agree that some sounds or 
syllables sound ‘louder’ than others; and of /ength—some sounds will 
be appreciably longer to our ears than others. These are judgments 
made by a listener in respect of asound continuum emitted by a speaker 
and, if the sound stimulus from the speaker and response from the 
listener are made in terms of the same linguistic system, then the 
utterance will be meaningful for speaker and listener alike. It is reason- 
able to assume, therefore, that there is some constant relationship 
between the speaker’s articulation and the listener’s reception of sound 
variations. In other words, it should be possible to link through the 
transmission phase the listener’s impressions of changes of quality, 
pitch, loudness, and length to some articulatory activity on the part of 
the speaker. It will in fact be seen that an exact parallelism or correla- 
tion between the production, transmission, and reception phases of 
speech is not always easy to establish, the investigation of such relation- 
ships being one of the tasks of present-day phonetic studies. 
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3.1.2 The formation of any sound requires that a vibrating medium 
should be set in motion by some kind of energy. We have seen that in the 
case of the human speech mechanism the function of vibrator 1s often 
fulfilled by the vocal folds and that these are activated by air pressure 
from the lungs. In addition, any such sound produced in the larynx is 
modified by the resonating chambers of the pharynx, mouth and, in 
certain cases, the nasal cavity. The listener’s impression of sound 
quality will be determined by the way in which the speaker’s vibrator 
and resonators function together. 

3.1.3 Speech sounds, like other sounds, are conveyed to our ears by 
means of waves of compression and rarefaction of the air particles (the 
commonest medium of communication). These variations in pressure, 
initiated by the action of the vibrator, are propagated in all directions 
from the source, the air particles themselves vibrating at the same rate 
(or frequency) as the original vibrator. In speech, these vibrations may 
be of a complex but regular pattern, producing ‘tone’ such as may be 
heard in a vowel sound; or they may be of an irregular kind, producing 
‘noise’, such as we have in the consonant s; or there may be both regular 
and irregular vibrations present, i.e. a combination of tone and noise, 
as in z. In the production of normal vowels, the vibrator is normally 
provided by the vocal folds; in the case of many consonant articula- 
tions, however, a source of air disturbance is provided by constriction 
at a point above the larynx, with or without accompanying vocal fold 
vibrations. 

Despite the fact that the basis of all normal vowels is the glottal tone, 
we are all capable of distinguishing a large number of vowel qualities. 
Yet there is no reason to suppose that the glottal vibrations in the case of 
[a: ] are very different from those for [i:], when both vowels are said 
with the same pitch. The modifications in quality which we perceive are 
due to the action of the supra-glottal resonators which we have pre- 
viously described. To understand this action, it is necessary to consider 
a little more closely the nature of the glottal vibrations. 

3.1.4 Ithas already been mentioned that the glottal tone is the result 
of a complex, but mainly regular, vibratory motion. In fact, the vocal 
folds vibrate in such a way as to produce, in addition to a basic vibra- 
tion over their whole length (the fundamental frequency), a number of 
overtones or harmonics having frequencies which are simple multiples 
of the fundamental or first harmonic. Thus, if there is a fundamental 
frequency of vibration of 100 cycles per second (cps), the upper har- 
monics will be of the order of 200, 300, 400, etc., cps. Indeed, there may 
be no energy at the fundamental frequency, but merely the harmonics 
of higher frequency such as 200, 300, 400 cps. Nevertheless, we still 
perceive a pitch which is appropriate to a fundamental frequency of 
100 cps, 1.e. the fundamental frequency is the highest common factor 
of all the frequencies present, whether or not it is present itself. 
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The number and strength of the component frequencies of this com- 
plex glottal tone will differ from one individual to another and this 
accounts at least in part for the differences of voice quality by which we 
are able to recognize speakers. But we can all modify the glottal tone so 
as to produce at will vowels as different as [i: ] and [a: ], so that despite 
our divergences of voice quality we can convey the distinction between 
two words such as key and car. This variation of quality, or timbre, of 
the glottal tone is achieved by the shapes which we give the resonators 
above the larynx—the pharynx, mouth, and nasal cavity. These 
chambers are capable of assuming an infinite number of shapes, each of 
which will have a characteristic vibrating resonance of its own. Those 
harmonics of the glottal tone which coincide with the chamber’s own 
resonance are very considerably amplified. Thus, certain bands of 
strongly reinforced harmonics are characteristic of a particular 
arrangement of the resonating chambers which produces, for instance, 
a certain vowel sound. Moreover, these bands of frequencies will be 
reinforced whatever the fundamental frequency. In other words, what- 
ever the pitch on which we say, for instance, the vowel [a: ], the shaping 
of the resonators and their resonances will be very much the same, so 
that it is still possible, except on extremely high or low pitches, to 
recognize the quality intended. It 1s found that the vowel [a:] has one 
such characteristic band of strong components in the region of 800 cps 
and another at about 1,100 cps. The vowel [i: ] has bands of energy at 
about 280 and 2,500 cps. 


3.2 Acoustic Spectrum 


This complex range of frequencies of varying intensity which go to 
make up the quality of a sound is known as the acoustic spectrum; those 
bands of energy which are characteristic of a particular sound are 
known as the sound’s formants. Thus, formants of [a:] are said to 
occur in the region 800 and 1,100 cps. Such a complex wave pattern may 
be analysed by means of a number of instrumental techniques, most of 
which involve lengthy calculations; e.g. the number of fundamental 
vibrations may be counted on an oscillographic tracing. But the instru- 
ment known as the sound spectrograph, originally designed to render 
speech patterns visible to the deaf, makes possible a relatively rapid and 
visual presentation of the spectrum and the various intensities of sound 
contained in it. In an instrument of this sort in common use, a number 
of filters, covering a range of frequencies from 0—8,000 cps, respond to 
the varying sound intensities at different frequencies and ultimately 
produce a length of paper giving a three-dimensional display of the 
acoustic spectrum: frequency is shown on the vertical axis, time on 
the horizontal, and the energy at any frequency level by means of the 
degree of blackening made by the tracing pen on the paper. Thus, the 
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FIG. 3. Spectrograms of /i:/, ,'a:', tar, /s/, "Ji, and W’e ll take a dozen. 
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concentrations of energy at particular frequency bands (the formants) 
stand out very clearly (see Fig. 3 for the spectrogram of the vowels [a: ] 
and [1:]). Moreover, since the machine permits the analysis of an utter- 
ance lasting up to about 2+ seconds, the everchanging pattern of the 
spectrum of continuous speech (see Fig. 3) and the difficulty of 
dividing up an utterance into separate (discrete) segments are made 
obvious. Such spectrographic analysis provides a great deal of acoustic 
information in a convenient form and, in addition, the instrument itself 
is relatively simple for a phonetician without special training to operate. 
Nevertheless, much of the information given is, in fact, irrelevant to 
our understanding of speech and the phonetician is obliged to establish 
by other methods the elements of the spectrum which are essential to 
speech communication. 

Although there remain a number of problems to be solved by spectro- 
graphic analysis, much is already known about the acoustic structure of 
vowels. For instance, two, or at the most three, formants appear to be 
sufficient for their correct identification. As far as the English vowels 
are concerned, the first three formants are all included in the frequency 
range 0—4,000 cps, so that the spectrum above 4,000 cps would appear 
to be largely irrelevant to the recognition of our vowels. It is true that on 
a telephone system, which may have a frequency range of about 
300—3,000 cps, we find little difficulty in identifying the sound patterns 
used by a Speaker and are even able to recognize voice qualities. Indeed, 
when we are dealing with a complete utterance in a given context, 
where, as we have seen above (pp. 5—6), there is a multiplicity of cues 
to help our understanding, a high degree of intelligibility may be 
retained even when there are no frequencies above 1,500 cps. 

As one would suspect, there appear to be certain relationships 
between the formants of vowels and the cavities of the vocal tract (i.e. 
the shapes taken on by the resonators, notably the relation of the oral 
and pharyngeal cavities). Thus, the Ist formant appears to be low when 
the tongue is high in the mouth: e.g. [i:] and [u:], having high tongue 
positions, have Ist formants of the order of 280—300 cps, whereas [a: ] 
and [p] have their Ist formants in the,region 600—800 cps, their tongue 
positions being relatively low. On the other hand, the 2nd formant 
seems to be inversely related to the length of the front cavity: thus [1:], 
where the tongue Is raised high in the front of the mouth, has a 2nd 
formant at about 2,500 cps, whereas [u:], where the tongue is raised at 
the back of the mouth and lips are rounded, has a relatively low 2nd 
formant around 900 cps. 

It is also confirmed from spectrographic analysis that a diphthong, 
such as that in my, is indeed a glide between two vowel elements (besides 
involving a perceptible articulatory movement), since the formants 
bend from those positions typical of one vowel to those characteristic of 
another (see Fig. 3). 
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For many consonant articulations (e.g. the initial sounds in pin, 
tin, kin, thin, fin, sin, shin, in which the glottal vibrations play no 
part) there is an essential noise component, deriving from an obstruc- 
tion or constriction within the mouth, approximately within the range 
2,000—8,000 cps (see Fig. 3). This noise component is also present in 
analogous articulations in which the vocal note is present, as in the final 
sounds of ruse and rouge where, as we have seen, we are dealing with 
sounds which consist of a combination of glottal tone and noise. Rele- 
vant acoustic data concerning both vowel and consonant articulations 
will be given in the sections dealing with individual English sounds 
(Chapters 7 and 8). 

Spectrographic analysis also reveals the way in which there tends, on 
the acoustic level, to be a merging of features of units which, lin- 
guistically, we treat separately. Thus, our discrimination of [f] and [8] 
sounds would appear to depend not only on the frequency and duration 
of the noise component but also upon a characteristic bending of the 
formants of the adjacent vowel. Indeed, 1n the case of such consonants 
as [p, t, k], which involve a complete obstruction of the air-stream and 
whose release is characterized acoustically by a relatively brief burst of 
noise, the vowel transition between the noise and the steady state of the 
vowel appears to be of prime importance for our recognition of the con- 
sonant. This overlapping of vowels and consonants would suggest that 
an analysis of speech based solely upon acoustic data would find it con- 
venient to operate with units larger than the sound segment. 

3.2.1 Our perception of the pitch of a speech sound depends 
directly upon the frequency of vibration of the vocal folds. Thus, we are 
normally conscious of the pitch caused by the ‘voiced’ sounds, espe- 
clally vowels; pitch judgments made on voiceless or whispered sounds, 
without the glottal tone, are limited in comparison with those made on 
voiced sounds, and are induced mainly by variations of intensity or by 
the dominance of certain harmonics brought about by the dispositions 
of the resonating cavities. 

The higher the glottal fundamental frequency, the higher our impres- 
sion of pitch. A male voice may have an average pitch level of about 
150 cps and a female voice a level in the region of 240 cps. The pitch 
level of voices, however, will vary a great deal between individuals and 
also within the speech of one speaker, the total range of a speaking 
voice being liable to have a range as extensive as 80—350 cps. Yet our 
perception of pitch change extends further than the limits of glottal 
fundamental frequency, since our recognition of quality depends upon 
frequencies of a much higher order. In fact, the human ear perceives 
frequencies from as low as 16 cps to about 20,000 and in some cases 
even higher. As one becomes older, this upper limit may fall consider- 
ably, so that at the age of fifty it may extend no higher than about 
10,000 cps. As we have seen, such a reduced range is no impediment to 
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perfect understanding of speech, since a high percentage of acoustic 
cues for speech recognition fall within the range 0—4,000 cps. 

Our perception of pitch is not, however, solely dependent upon 
fundamental frequency. Variations of intensity on the same frequency 
may induce impressions of a change of pitch; and again, tones of very 
high or very low frequency, 1f they are to be audible at all, require 
greater intensity than those in a middle range of frequencies. 

As far as the variation of pitch in speech is concerned, it is by no 
means easy to measure the continuous changes. lt is, of course, possible 
to count the fundamental vibrations within a selected segment in a trace 
such as is provided by an oscillograph. But, besides being a laborious 
process, such a method is likely to give us a pattern which is much more 
complicated than our auditory impression of pitch change. Our 
impression of pitch changes in speech results largely from fundamental 
frequencies carried by vowels and other voiced sounds, the voiceless 
sounds being discounted. With pitch, as with quality, an instrumental 
analysis does not make this human linguistic discrimination and may, 
therefore, provide us with a good deal of irrelevant information. 
Several instrumental procedures do, however, exist which go far 
towards showing us the grosser pattern which is relevant to linguistic 
work, e.g. a general pitch curve can be seen by tracing a harmonic on an 
expanded spectrographic display. The second harmonic is a convenient 
one to choose, since this 1s low enough usually not to be obscured by 
formant intensity nor very much deformed by formant transitions. 

Since then, a technique making use of the /aryngograph (Abberton 
and Fourcin, 1983) has been developed which provides immediate 
visual feedback of fundamental frequency contours which correspond 
essentially to our impression of a pitch pattern. 

3.2.2 Our sensation of the relative loudness of sounds may depend 
on several factors. A sound or syllable may appear to stand out from its 
neighbours—be ‘louder’ —because a marked pitch change is associated 
with it or because it is longer than its neighbours. It is better to use a 
term such as prominence to cover these general listener-impressions of 
variations in the perceptibility of sounds. More strictly, what is ‘loud- 
ness’ at the receiving end should be related to intensity at the production 
Stage, which in turn is related to the size or amplitude of the vibration 
and the speaker’s feeling for ‘stress’. Generally speaking, it may be said 
that an increase in amplitude of vibration, with its resultant impression 
of greater loudness, is brought about by an increase of air-pressure 
from the lungs. When a sound or syllable is stressed, it is being uttered 
with more muscular effort, increased air pressure, and greater ampli- 
tude of vibration. As we shall see (§9.2), this greater intensity is not in 
itself usually the most important factor in rendering a sound prominent 
in English. Moreover, all other things being equal, some sounds appear 
by their nature to be louder or more sonorous than others: e.g. the 
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vowel in barn has more carrying power than that in bean, and vowels 
generally are more powerful than consonants. 

The judgments we make concerning loudness are not as fine as those 
made for either quality or pitch. We may judge which of two sounds is 
the louder, but we find it difficult to express the extent of the difference. 
Indeed, in terms of our linguistic system, we need perceive and interpret 
only gross differences of loudness, despite the fact that when we judge 
quality we are, in recognizing the formant structure of a sound, reacting 
to characteristic regions of strong intensity in the spectrum. 

3.2.3 In addition to affording different auditory impressions of 
quality, pitch, and loudness, sounds may appear to a listener to be of 
different /ength. Clearly, whenever it is possible to establish the 
boundaries of sounds or syllables, it will be possible to measure their 
duration by means of such traces as are provided by oscillograms or 
spectrograms. Such delimitation of units, in both the articulatory and 
acoustic sense, may be difficult, as we shall see when we deal with the 
segmentation of the utterance. But, even when it can be done, varia- 
tions of duration in acoustic terms may not correspond to our linguistic 
judgments of length. We shall, for instance, refer later to the ‘long’ 
vowels of English such as those of bean and barn, as compared with the 
‘short’ vowel in bin. But, in making such statements, we shall not be 
referring to absolute duration values, since the vowel of barn may in 
fact, in any given utterance, be very considerably longer than that of 
bean. In the English system, however, we know that only two degrees of 
length are linguistically significant and all absolute durations will be 
interpreted in terms of this relationship. This distinction between 
measurable duration and linguistic length provides another example of 
the way in which our linguistic sense interprets from the acoustic 
material only that which is significant. 

The sounds comprising any utterance will have varying durations 
and, notably in English, we will have the impression that some syllables 
are longer than others. Such variations of length within the utterance 
constitute one manifestation of the dynamic rhythmic delivery which is 
characteristic of English and so is fundamentally different from the 
flow of other languages, such as French, where syllables tend to be of 
much more even length. 

The absolute duration of sounds or syllables will, of course, depend 
upon the speed of utterance. An average rate of delivery might contain 
anything from about 6 to 20 sounds per second, but lower and much 
higher speeds are frequently used without loss of intelligibility. The 
time required for the recognition of a sound will depend upon the 
nature of the sound and the pitch, vowels and consonants differing con- 
siderably in this respect, but it seems that a vowel lasting only about 
4 msecs may have a good chance of being recognized. 
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3.3 Hearing 


Our hearing mechanism must be thought of in two ways: the physiolog- 
ical mechanism which reacts to the acoustic stimuli—the varying pres- 
sures in the air which constitute sound; and the psychological activity 
which, at the level of the brain, selects from the gross acoustic informa- 
tion that which is relevant in terms of the linguistic system involved. In 
this way, measurably different acoustic stimuli may be interpreted as 
being the ‘same’ sound unit. As we have seen, only part of the total 
acoustic information seems to be necessary for the perception of 
particular sound values. One of the tasks which confront the phoneti- 
clan at the moment is the disentanglement of these relevant features 
from the mass of acoustic material, such as modern methods of sound 
analysis make available. The most fruitful technique of discovering the 
significant acoustic cues is that of speech synthesis, controlled by 
listeners’ judgments. After all, the sounds [a:] and [s] are [a: ] and [s] 
only if listeners recognize them as such. Thus, it has been established 
that only two formants are necessary for the recognition of vowels, 
because machines which generate sound of the appropriate frequency 
bands and intensity produce vowels which are correctly identified by 
listeners. 

Listeners, without any phonetic training, can, therefore, frequently 
give valuable guidance by their judgments of synthetic qualities. But it 
is important to be aware of the limitations of such listeners, so as to be 
able to make a proper evaluation of their judgments. A listener’s 
reactions are normally conditioned by his experience of handling his 
own language. Thus, if there are only five significant vowel units in his 
language, he is liable to allow a great deal more latitude in his assess- 
ment of whaf is the ‘same’ vowel sound than if he has twenty. An 
Englishman, for instance, having a complex vowel system and being 
accustomed to distinguishing such vowels as those in sit, set, sat, will be 
fairly precise in his judgment of an e type of vowel. A Spaniard, how- 
ever, whose vowel system is made up of fewer significant units, is likely 
for this reason to be disposed to be more tolerant of variation of 
quality, while still identifying a sound of the e type. Or again, if a 
listener is presented with a system of synthetic vowels which is 
numerically the same as his own, he is able to make allowance for con- 
siderable variations of quality between his and the synthetic system and 
still identify the vowels correctly—by their ‘place’ in a system rather 
than by their precise quality; this is what he does when he listens to and 
understands his language as used by a speaker of a different dialect. 
Tests involving synthetic speech and concerned with the recognition of 
absolute vowel qualities would have, therefore, to be presented to a 
listener in such a way that his own linguistic prejudices have the least 
possible influence on his judgments. 
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Our hearing mechanism also plays an important part 1n monitoring 
our own speech; it places a control upon our speech production which is 
complementary to our motor, articulatory habits. If this ‘feed-back’ 
control is disturbed, e.g. by the imposition of an artificial delay upon 
our reception of our own speech, disturbance in the production of our 
utterance is likely to result. Those who are born deaf or who become 
deaf before the acquisition of speech habits are rarely able to learn 
normal speech; similarly, a severe hearing loss later in life is likely to 
lead eventually to a deterioration of speech. 


4 


Description and 
Classification of Speech 
Sounds 


4.1 Phonetic Description 


We have considered briefly both the mechanism which produces speech 
sounds and also some of the acoustic and auditory characteristics of the 
sounds themselves. It is now important to formulate a method of 
description and classification of the sound types which occur in speech 
and, more particularly, in English. We have seen that a speech sound 
has at least three stages available for investigations—the production, 
transmission, and reception stages. A complete description of a sound 
should, therefore, include information concerning all three stages. To 
describe the first sound in the word fen merely in terms of the move- 
ments of the organs of speech is to ignore the nature of the sound which 
is produced and the features perceived by a listener. Nevertheless, to 
provide all the information in respect of all phases entails a lengthy 
description, much of which may be irrelevant to a particular purpose. 
For example, since the description of the sounds of a language has in the 
past been most commonly used in the teaching of the language to 
foreigners, the emphasis has always been laid on the articulatory event. 
Moreover, it is only comparatively recently that there has existed any 
considerable body of acoustic information concerning speech. In the 
future, it is likely that acoustic analysis will provide a general descrip- 
tive framework applicable to all speech sounds. But, at the moment, the 
most convenient and brief descriptive technique relies either on 
articulatory criteria or on auditory judgments, or on a combination of 
both. Thus, those sounds which are commonly known as ‘consonants’ 
are most easily described mainly in terms of their articulation, whereas 
‘vowel’ sounds require for their description a predominance of 
auditory impressions. 


4.2 Vowel and Consonant 


The terms ‘vowel’ and ‘consonant’ have been in use for many centuries. 
But it is clear that, if we are to distinguish between a purely phonetic 
investigation of concrete, measurable material and a linguistic analysis 
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in terms of abstractions from this material, any attempt to use ‘vowel’ 
and ‘consonant’ to cover categories at both levels may well be 
embarrassing, as will be shown in the next chapter when we consider 
linguistic analysis. In the past, when such a distinction was not 
generally made and the terms served to denote sometimes phonetic, 
sometimes linguistic categories, vowel and consonant tended to have a 
predominantly linguistic significance, ‘vowel’ having typically a central 
syllabic function, whereas ‘consonant’ was marginal in the syllable. 
When vowel and consonant have been defined phonetically, the 
criterion of distinction has generally been one of stricture, i.e. the 
articulation of vowels is not accompanied by any closure or narrowing 
in the speech tract which would prevent the escape of the air stream 
through the mouth or give rise to audible friction; all other sounds 
(necessitating a closure or a narrowing which involves friction) are con- 
sonants. Such a definition entails difficulty with such sounds as 
Southern British [l, r], which are traditionally consonants but which 
are more characteristically vowel-like in the terms of the definition; the 
difficulty has sometimes been avoided by calling such sounds semi- 
vowels or semi-consonants. It is clearly helpful if vowel and consonant 
are kept as terms to denote linguistic categories, whereas, if a phonetic 
division of sound types is required, other terms should be used to name 
them (cf. the use by K. L. Pike! of vocoid/contoid for the phonetic 
types and vowel/consonant for the linguistic categories). 

From the practical phonetic standpoint, it is convenient to distin- 
guish two types of speech sound, simply because the majority of sounds 
may be described and classified most appropriately according to one of 
two techniques: — 

1. the type of sound which is most easily described in terms of 
articulation, since we can generally feel the contacts and move- 
ments involved. Such sounds may be produced with or without vocal 
fold vibration (voice) and very often have a ‘noise’ component 
in the acoustic sense; these sounds fall generally into the traditional 
category of consonants and will be known here as the consonantal 
type. 

2. the type of sound, depending largely on very slight variations of 
tongue position, which is most easily described in terms of auditory 
relationships, since there are not contacts or strictures which we can feel 
with any precision. Such sounds are generally voiced, having no noise 
component but rather a characteristic patterning of formants; these 
sounds fall generally into the traditional category of vowels and will be 
known here as the vowel type. 

We shall be able to assign the sounds of English to one or other of 
these phonetic classes but, as we shall see (Chapter 5), the linguistic 


| As, for instance, in his Phonemics, University of Michigan, 1947. 
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categorization of sound units will not always correspond to the 
phonetic one. 


4.3 The Consonantal Type (described in mainly articulatory terms) 


We have seen, in the preceding chapters, that the production of a speech 
sound may involve the action of a source of energy, a vibrator, and the 
movement of certain supra-glottal organs. In the case of consonantal 
articulations, a description must provide answers to the following 
questions: — 

(1) Is the air-stream set in motion by the lungs or by some other 
means? (pulmonic or non-pulmonic). 

(2) Is the air-stream forced outwards or sucked inwards? (egressive 
or ingressive). 

(3) Do the vocal folds vibrate or not? (voiced or voiceless). 

(4) Is the soft palate raised, directing the air-stream through the 
mouth, or lowered, allowing the passage of air through the nose? (oral, 
or nasal or nasalized). 

(5) At what point or points and between what organs does the 
closure or narrowing take place? (place of articulation). 

(6) What is the type of closure or narrowing at the point of articula- 
tion? (manner of articulation). 

In the case of the sound [z], occurring medially in the word easy, the 
following answers would be given:— 

(1) pulmonic; (2) egressive; (3) voiced; (4) oral; (5) tongue 
blade—alveolar; (6) fricative. 
These answers provide a concise phonetic label for the sound; a more 
detailed description would include additional information concerning, 
for instance, the shape of the remainder of the tongue, the relative 
position of the jaws, the lip position, etc. 


4.3.1 Egressive Pulmonic Consonantal Sounds 


Most speech sounds, and all normal English sounds, are made with 
egressive lung air. At any point of articulation, a consonantal articula- 
tion may be voiceless or voiced. 


4.3.2 Place of Articulation 


The chief points of articulation, with special reference to the sounds of 
English, are the following:— 

Bilabial.—The two lips are the primary articulators, e.g. [p, b, m}. 

Labio-dental.—The lower lip articulates with the upper teeth, e.g. 
py]: 

Dental. —The tongue tip and rims articulate with the upper teeth, e.g. 
[0, ð]. 
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Alveolar.—The blade, or tip and blade, of the tongue articulates with 
the alveolar ridge, e.g. [t, d, 1, n, s, z]. 

Post-alveolar.—The tip (and rims) of the tongue articulate with the 
rear part of the alveolar ridge, e.g. [1, ta, da]. 

Retroflex.—The tip of the tongue is curled back to articulate with the 
part of the hard palate immediately behind the alveolar ridge, e.g. a 
retracted [1] such as is found in South-West British and American 
English. 

Palato-alveolar.—The blade, or the tip and blade, of the tongue arti- 
culates with the alveolar ridge and there is at the same time a raising of 
the front of the tongue towards the hard palate, e.g. [f, 3, tf, d3]. 

Palatal.—The front of the tongue articulates with the hard palate, 
e.g. [J] or [ç] as in queue [kju:] or [kcu:], or a very advanced type of 
[k, g] = [c, J]. i 

Velar.—The back of the tongue articulates with the soft palate, e.g. 
[k, 9, 9]. 

Uvular.—The back of the tongue articulates with the uvula, e.g. 
(K, R]. 

Glottal.—An obstruction, or a narrowing causing friction but not 
vibration, between the vocal folds, e.g. [?, h]. 

In the case of some consonantal sounds, there may be a secondary 
place of articulation in addition to the primary. Thus, in the so-called 
‘dark’ [t], in addition to the partial alveolar contact, there is an essential 
raising of the back of the tongue towards the velum (velarization); or, 
again, some post-alveolar articulations of [1] are accompanied by slight 
lip-rounding (labialization). The place of primary articulation is that of 
the greatest stricture, that which gives rise to the greatest obstruction to 
the airflow. The secondary articulation exhibits a stricture of lesser 
rank. Where there are two co-extensive strictures of equal rank an 
example of double articulation results. (See $8.9.2.) 


4.3.3 Manner of Articulation 


The obstruction made by the organs may be total, intermittent, partial, 
or may merely constitute a narrowing sufficient to cause friction. The 
chief types of articulation, in decreasing degrees of closure, are as 
follows: — 


(1) Complete Closure 


Plosive.—A complete closure at some point in the vocal tract, behind 
which the air pressure builds up and can be released explosively, e.g. 
[Debs tud. Oe 

Affricate.—A complete closure at some point in the mouth, behind 
which the air pressure builds up; the separation of the organs is slow 
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compared with that of a plosive, so that friction is a characteristic 
second element of the sound, e.g. [tf, d3, tu, du]. 

Nasal.—A complete closure at some point in the mouth but, the soft 
palate being lowered, the air escapes through the nose. These sounds 
are continuants and, in the voiced form, have no noise component; they 
are, to this extent, vowel-like (see §4.3.5), e.g. [m, n, p]. 


(2) Intermittent Closure 


Trill or Roll.—A series of rapid intermittent closures or taps made bya 
flexible organ on a firmer surface, e.g. [r], where the tongue tip taps 
against the alveolar ridge, or [R] where the uvula taps against the back 
of tongue. 

Tap.—A single tap made by a flexible organ on a firmer surface, e.g. 
[c] where the tongue tip taps once against the teeth ridge. 


(3) Partial Closure (i.e. median) 


Lateral.—A partial (but firm) closure is made at some point in the 
mouth, the air-stream being allowed to escape on one or both sides of 
the contact. These sounds may be continuant and non-fricative and 
therefore vowel-like, as in [l, t], or they may be accompanied by a little 
friction [l] or by considerable friction [f]. 


(4) Narrowing 


Fricative.—Two organs approximate to such an extent that the air- 
stream passes between them with friction, e.g. [f, v, 9, 6, s, Z, f, 3, Ç, X, 
h]. In the bilabial region, a distinction is to be made between those 
purely bilabial such as [0, B] where the friction occurs between spread 
lips, and a labial-velar sound like [m] where the friction occurs between 
rounded lips and is accompanied by a characteristic modification of the 
mouth cavity brought about by the raising of the back of the tongue 
towards the velum. ; 


4.3.4 Approximants 


Approximants (or frictionless continuants), such as the common 
variety of Southern British [1], have neither the closure nor the noise 
component characteristic of consonantal articulations; they are, 
however, frequently variants of consonantal types, as well as having the 
functional status of consonants and may therefore be included under 
this heading. 

[w] and [j] (also referred to as semi-vowels) are usually included in the 
consonantal category on functional grounds, but from the point of 
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view of phonetic description they are more properly treated as vowel 
glides. 


4.3.5 Obstruents and Sonorants 


It is sometimes found useful to classify categories of sounds-according 
to their noise component. Those in whose production the constriction 
impeding the airflow through the vocal tract is sufficient to cause noise 
are known as obstruents. This category comprises plosives, fricatives 
and affricates. Sonorants are those voiced sounds in which there is no 
noise component (i.e. nasals, approximants and vowels). 


4.3.6 Fortis and Lenis 


A voiceless/voiced pair such as [s, z} are distinguished not only by the 
presence or absence of voice but also by the degree of breath and 
muscular effort involved in the articulation. Indeed, we shall see that on 
the linguistic level, in certain situations, the voice opposition may be 
lost, so that the energy of articulation becomes a significant factor. 
Those English consonants which are usually voiced tend to be arti- 
culated with relatively weak energy, whereas those which are always 
voiceless are relatively strong. Thus, it may be important to define [s], 
for instance, as strong or fortis and {z} as weak or lenis. But see $8.5 for 
a discussion of fortis and lenis as phonological categories. 


4.3.7 Classification of Egressive Pulmonic Consonantal 
Articulations 


The essential factors to be included in any classificatory chart refer 
to:— 


(1) The place of articulation. 

(2) The manner of articulation. 

(3) The presence or absence of voice. 
(4) The position of the soft palate. 


The customary chart is based on a vertical axis showing manner of 
articulation; a horizontal axis showing place of articulation; a pairing 
of consonantal types to show the voiceless (or fortis) variety on the left 
and the voiced (or lenis) variety on the right; and an extra column under 
‘manner’ to include the relatively few consonantal types which require 
the lowered position of the soft palate. The chart, shown in Fig. 4, 
includes the symbols of the sound types of British English. 
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4.3.8 Egressive Consonants made with the Glottalic (or 
Pharyngeal) Airstream Mechanism 


Speech sounds may be articulated without the passage of lung air into 
the upper speech tract. In the production of these sounds, known as 
ejective, the glottis is closed, so that lung air is contained beneath it. A 
closure or narrowing is made at some point above the glottis (the soft 
palate being raised) and the air between this point and the glottis is com- 
pressed by a general muscular constriction of the chamber and a raising 
of the larynx. Thus, a bilabial ejective plosive sound [p’] may be made 
by compressing the air in this way behind the lips. However, it is not 
only plosives which may be ejective; affricates and fricatives commonly 
have this type of compression in a number of languages, e.g. [ts’, tl’, s’, 
x’, etc.]. If the glottis is tightly closed, it follows that this type of 
articulation can apply only to voiceless sounds. [p’, t’, k’] occur some- 
times in final positions. These are not to be confused with the more 
common variants of final [p, t, k] which are frequently (e.g. in London 
English) replaced or reinforced by a glottal stop, e.g. the final sound in 
the word stop may be replaced by a glottal stop or have a glottal closure 
accompanying the bilabial one, but there is no compression between the 
glottal and bilabial closures. 


4.3.9 Sounds Made with an Ingressive Air-stream 


A common method of expressing surprise or pain involves the energetic 
inspiration of air accompanied by bilabial friction. It is also possible to 
produce sounds with an ingressive air-stream in the articulation of 
which a complete closure is made at some point in the mouth. Thus, a 
bilabial stop may be made, with the soft palate raised, as for [b]; but, 
instead of air pressure from the lungs being compressed behind the 
closure, the almost completely closed larynx is lowered so that the air in 
the mouth and pharyngeal cavities is rarefied. The result is that the 
outside air is sucked in once the mouth closure is released; at the same 
time, there is sufficient leakage of lung air through the glottis to 
produce voice. It will be seen that the resulting sound is made by means 
of a combined airstream mechanism, namely a pulmonic airstream in 
combination with ingressive glottalic (or pharyngeal) air. Such ingres- 
sive stops (generally voiced) are known as implosives and occur with 
bilabial [6], dental or alveolar [d], or velar [g] mouth closures. Though 
such sounds occur ina number of languages, sometimes in the speech of 
the deaf and in types of stammering, they are not found in normal 
English. 

Another set of sounds involving an ingressive air-stream, this time an 
ingressive velaric (or oral) airstream mechanism, is produced entirely 
by means of closures within the mouth cavity; normal breathing 
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through the nose may continue quite independently if the soft palate 
is lowered. Thus, the sound made to indicate irritation or sympathy 
(often written as ‘Tut-tut’) is articulated by means of a double closure, 
the back of the tongue against the velum and the tip, blade, and sides 
against the alveolar ridge and side teeth. The cavity contained within 
these closures is then enlarged mainly by tongue movement, so that the 
air is rarefied. The release of the forward closure causes the outer air to 
be sucked in; the release may be crisp in which case a sound of a plosive 
type is heard, or relatively slow, in which case an affricated sound is 
produced. These sounds are known as clicks, the one referred to above 
being a dental or alveolar click [1]. The sound made to encourage horses 
is a lateral click, 1.e. the air is sucked in by releasing one side of the 
tongue [s]. These clicks, extralinguistic in English, and several others 
occur as significant sounds in a number of languages.! 


4.4 The Vowel Type (described in mainly auditory terms) 


This category of sounds is normally made with a voiced egressive air- 
stream, without any closure or narrowing such as would result in the 
noise component characteristic of many consonantal sounds; more- 
over, the escape of the air is characteristically accomplished in an 
unimpeded way over the centre line of the tongue. We are now con- 
cerned with a glottal tone modified by the action of the upper 
resonators of the mouth, pharyngeal and nasal cavities. As we have 
seen (Chapters 2 and.3), the movable organs mainly responsible for 
shaping these resonators are the soft palate, lips, and tongue. A descrip- 
tion of vowel-like sounds must, therefore, note:— 


(1) the position of the soft palate—raised for oral vowels, lowered 
for nasalized vowels; 

(2) the kind of aperture formed by the lips—degrees of spreading 
or rounding; 

(3) the part of tongue which is raised and the degree of raising. 


Of these-three factors, only the second—the lip position— can be 
easily described by visual or tactile means. Our judgment of the action 
of the soft palate depends less on our feeling for its position than on our 
perception of the presence or absence of nasality in the sound 
produced. Again, the movements of the tongue, which so largely deter- 
mine the shape of the mouth and pharyngeal cavities, may be so minute 
that it is impossible to assess them by any simple means in terms of 
positions; moreover, there being normally no contact of the tongue 
with the roof of the mouth, no help is given by any tactile sensation. A 


1See pp. 340—1 for the Chart of the International Phonetic Alphabet which provides 
symbols to denote the sound types occurring in languages. 
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vowel description will usually, therefore, be based mainly on auditory 
judgments of sound relationships, together with some articulatory 
information, especially as regards the position of the lips. In addition, 
an acoustic description can be given in terms of the disposition of the 
characteristic formants of the sound. 


4.4.1 Difficulties of Description 


The description of vowel sounds, especially by means of the written 
word, has always presented considerable difficulty. Certain positions 
and gross movements of the tongue can be felt. We are, for instance, 
aware that when we pronounce most vowel sounds the tongue tip lies 
behind the lower teeth; moreover, in comparing two such vowels as [1i: ] 
(key) and [a: ] (car) (Fig. 5), we can feel that, in the case of the former, 
the front of the tongue 1s the part which is mainly raised, whereas, inthe 
case of the latter, such raising as there is is accomplished by the back 
part of the tongue. Therefore, it can be stated in articulatory terms that 
some vowel sounds require the raising of the front of the tongue, while 
others are articulated with a typical ‘hump’ at the back; and these state- 
ments can be confirmed by means of X-ray photography. But the actual 
point and degree of raising is more difficult to judge. It is not, for 
instance, helpful to say that a certain vowel is articulated with the front 
part of the tongue raised to within 5 mm of the hard palate. This may be 
a statement of fact for one person’s pronunciation, but an identical 
sound may be produced by another speaker with a different relation- 
ship between the tongue and palate. Moreover, we would not find it 
easy to judge whether our tongue was at 4or 5 mm from the palate. It is 
no more helpful to relate the vowel quality to a value used in a particular 
language, as is still so often done. A statement such as ‘a vowel quality 
similar to that in the English word cat’ is meaningless, since the vowel in 
cat may have a wide range of values in English. The statement becomes 







TAEI A 





FIG. 5.— Tongue positions of [i: ], [a: ]. 


Description and Classification of Speech Sounds 39 


more useful if the type of English is specified, but even then a number of 
variant interpretations will always be possible. 


4.4.2 The Cardinal Vowel Scale 


It is clear that a finer and more independent system of description is 
needed, on the auditory and articulatory levels. The most satisfactory 
scheme is that devised by Daniel Jones and known as the Cardinal 
Vowel system. The basis of the system is physiological, i.e. the two 
qualities upon which all the others were ‘hinged’ were produced with 
the tongue in certain easily felt positions: the front of the tongue raised 
as close as possible to the palate without friction being produced, for 
the cardinal vowel [1]; and the whole of the tongue as low as possible in 
the mouth, with very slight raising at the extreme back, for the cardinal 
vowel [a]. Starting from the [i] position, the front of the tongue was 
lowered gradually, the lips remaining spread or neutrally open and the 
soft palate raised. The lowering of the tongue was halted at three points 
at which the vowel qualities seemed, from an auditory standpoint, to be 
equi-distant. The tongue positions of these qualities were X-rayed and 
were found to be fairly equi-distant from a spatial point of view. The 
symbols [e, €, a] were assigned to these vowel values. The same pro- 
cedure was applied to vowel qualities depending on the height of the 
back of the tongue, this raising the back of the tongue from the [a] 
position; the lips were changed progressively from a wide open shape 
(for [a]) to a closely rounded one and the soft palate remained raised. 
Again, three auditorily equi-distant points were established from the 
lowest to the highest position; the corresponding tongue positions were 
photographed and the spatial relationships confirmed as for the front 
vowels. These values were given the symbols [9, o, u]. Thus, a scale of 
eight primary cardinal vowels was set up, denoted by the following 
numbers and symbols: 1, [1]; 2, [e]; 3, [e]; 4, [a]; 5, [a]; 6, [o]; 7, [o]; 
8, [u]. 

It is to be noticed that the front series [1, e, e, a] and [a] of the back 
series are pronounced with spread or open lips, whereas the remaining 
three members of the back series have varying degrees of lip-rounding. 
A secondary series can be obtained by reversing the lip position, e.g. 
close lip-rounding applied to the [i] tongue position, or lip spreading 
applied to the [u] position. Such a secondary series is denoted by the 
following numbers and symbols: 9, [y]; 10, [ø]; 11, [ce]; 12, [ce]; 13, [b]; 
14, [a]; 15, [x]; 16, [u]. In addition, a pair of cardinal vowels, 
unrounded and rounded, have been established, for which the centre of 
the tongue has the highest point of raising; these are: 17 (unrounded), 
[+]; 18 (rounded), [u]. This complete series of eighteen cardinal vowel 
values may be divided into two lip shape categories, with corresponding 
tongue positions:— 
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Unrounded—|1, €, €, a, a, A, Y, Ww, 1]. 
Rounded— [y, Ø, Œ, CE, D, 9, O, u, ul]. 


Such a scale is useful because (a) the vowel qualities are unrelated to 
particular values in languages, though many may occur in various 
languages, and (b) the set is recorded,' so that reference may always be 
made to a standard, invariable scale. Thus, a vowel quality might be 
described as being, for instance, similar to that of cardinal 2 ([e]), or 
another as being a type half-way between cardinal 6 ([9]) and cardinal 7 
({o]), but somewhat centralized. Diacritics are available to show modi- 
fications of cardinal values, e.g. a subscript hook , to mean more open, 
a subscript dot meaning closer, and raised dots ` to mean centralized.? 
The last example given above might in this way be symbolized as [5] or 
[6]. Other recorded cardinal values might be useful, especially for those 
types of vowel sound made by raising the centre part of the tongue, but 
the present scale 1s sufficient for most practical purposes. 

It is, moreover, possible to give a visual representation of these vowel 
relationships on a chart which is based on the cardinal vowel tongue 
positions. The simplified diagram shown in Fig. 6 is obtained by 
plotting the highest point of tongue raising for each of the primary 
cardinal vowels and joining the points together. The internal triangle, 
corresponding to the region of central or [a]-type vowel sounds, is made 
by dividing the top line into three approximately equal sections and 
drawing lines parallel to the two sides, so that they meet near the base of 





C [a] 


FIG. 6.—The primary Cardinal Vowels; the area symbolized by [5] or [6] shown 
as a black circle. 


l Two records (Eng. 252—3 and 254—5) are obtainable from the Linguaphone Institute. 

2For details of such diacritics and other modifiers, see The Principles of the Inter- 
national Phonetic Association, obtainable from the Secrctary of the Association, Depart- 
ment of Linguistics and Phonetics, University of Leeds. 
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the figure. On such a figure, the sound symbolized by [5] or [6] may 
have its relationship to the cardinal scale shown visually (see the black 
circle on Fig. 6). 

It must be understood that this diagram is a highly conventionalized 
one which shows, above all, quality relationships. Some attempt is, 
however, made to relate the shape of the figure to actual tongue 
positions: thus, the range of movement is greater at the top of the 
figure, and the tongue raising of front vowels becomes more retracted 
as the tongue position lowers. Nevertheless, it has been shown that it is 
possible to articulate vowel qualities without the tongue and lip 
positions which this diagram seems to postulate as necessary. It is, for 
instance, possible to produce a sound of the cardinal 7 ([o]) type with- 
out the lip-tongue relationship suggested. But, on the whole, it may be 
assumed that a certain auditorily identified vowel quality will be pro- 
duced by an articulation of the kind presupposed by the cardinal vowel] 
diagram. Moreover, it is a remarkable fact that the auditory judgments 
as to vowel relationships made by Daniel Jones half a century ago have 
been largely supported by recent acoustic analysis; in fact, a chart based 
on an acoustic analysis of cardinal vowel qualities corresponds very 
well with the traditional cardinal vowel figure. 


4.4.3 Nasality 


Besides the information concerning lip and tongue positions which the 
above chart and symbolization denote, a vowel description must also 
indicate whether the vowel is purely oral or whether it is nasalized. The 
eighteen cardinal vowels mentioned may all be transformed into their 
nasalized counterparts if the soft palate is lowered. It is not normally 
necessary, however, to have such an extensive series of nasalized 
cardinal vowels, since it is unusual (though not unknown) for languages 
to make such fine, significant distinctions of nasalized qualities as are 
common in the case of the purely oral values. 


4.4.4 Relatively Pure Vowels and Gliding Vowels 


It is clearly not possible for the quality of a vowel to remain absolutely 
constant (or, in other words, for the organs of speech to function for 
any length of time in an unchanging way). Nevertheless, we may distin- 
guish between those vowels which are relatively pure (or unchanging), 
such as the vowel in learn, and those which have a considerable and 
voluntary glide, such as the gliding vowel in /ine. The so-called pure 
vowels will be marked on the diagram as a dot, showing the highest 
point of the tongue or, better, a ring, since it would be inadvisable to 
attempt to be over-precise in the matter of these auditory judgments; 
the gliding (or diphthongal) vowel sound will be shown as an arrow, 
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lane 


FIG. 7.—The vowels of learn and line. 


which indicates the quality of the starting point and the direction in 
which the quality change is made (corresponding to a movement of the 
tongue). Fig. 7 shows the way in which the vowels of learn and line will 
be marked. 

We are now in a position to give a practical and comprehensive 
description of a vowel sound, partly in articulatory terms, partly in 
auditory terms. The vowel which we have symbolized above as [5] or [9] 
might be described in this way: ‘A vowel quality between cardinal 
vowels nos. 6 and 7, but having a somewhat centralized value; the lips 
are fairly closely rounded; and the soft palate is raised.’ Such a written 
description will have a meaning in terms of sound for anyone who 1s 
familiar with the cardinal vowel scale. There may, of course, be other 
features of the sound to mention, e.g. a breathy quality, or the fact that 
the vowel is articulated with the tongue tip retroflexed, rather than with 
the tongue tip in its more usual position behind the lower teeth. 


4.4.5 The Classification of Vowel Type Articulations 


In describing vowel sounds, we have taken into account especially the 
positions of the lips, tongue, and soft palate. A system of classification 
should include, but not confuse, all three factors. A distinction between 
description and classification was not always made and early attempts 
at systematic description often established only general classificatory 
categories. A typical seventeenth-century system is based on tongue 
positions, since the vertical axis of a two-dimensional chart deals with 
open, medium, and close types of articulation; but the horizontal 
plane, with its divisions of guttural, palatal, and labial, introduces a lip 
dimension which only confuses the categorization. As in the case of 
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Front Central Back 


Close 







Half-close 


Half-open 


Open 


FIG. 8.—Classification labels combined with the Cardinal Vowel diagram. 


consonants, it is, in fact, difficult to incorporate all factors in a two- 
dimensional chart and it is found most convenient to base classification 
on the tongue position. Our present system 1s not very different from 
that used tn the nineteenth century, when such divisions as Back, 
Mixed, and Front referred to the part of the tongue raised, and High, 
Mid, and Low to the degree of raising. To-day, we combine our classi- 
ficatory chart with the descriptive figure, naming those vowels in which 
the main raising is made by the front of the tongue towards the hard 
palate front vowels, those in which the back of the tongue is raised 
towards the soft palate back vowels, and those in which the centre is 
raised towards the juncture of the hard and soft palates central vowels. 
Instead of the threefold vertical division as regards degree of raising, it 
is customary to refer to four regions corresponding to the four cardinal 
degrees of raising; thus, the [i—u] level is known as the c/ose region; the 
[e—o] level as the half-close, the [e—9] level as the half-open, and the 
[a—a] level as the open (see Fig. 8). 

On a chart of this type, the factors of lip and soft palate positions 
have to be indicated separately. A vowel in the general region of 
cardinal 2 ([e]) would, therefore, be classified briefly as an oral, front, 
unrounded, half-close vowel. Very often, when it 1s a question simply 
of identifying the significant vowel units in terms of the system of a 
particular language, such a brief classificatory label will serve also as an 
adequate description. 


5 


Sounds in Language 


5.1 Speech Sounds and Linguistic Units 


We have now considered a method of describing and classifying the 
sounds capable of being produced by the speech organs. Speech is, 
however, a manifestation of language and spoken language is normally 
acontinuum of sound. A speech sound, produced in isolation and with- 
out the meaningfulness imposed by a linguistic system, may be 
described in purely phonetic terms; but any purely phonetic approach 
to the sounds of language encounters considerable difficulties. Two 
initial problems concern, firstly, the identification and delimitation of 
the sound unit (or segment) to be described and, secondly, the way in 
which different sounds are treated, for the purpose of linguistic 
analysis, as if they were the same. 

As we have seen, in any investigation of speech, it is on the physio- 
logical and acoustic levels that most information is available to us. Yet, 
especially on the articulatory level, as is revealed by moving X-ray 
photography, any utterance consists of apparently continuous move- 
ments by a very large number of organs; it is well-nigh impossible to 
say, simply from an X-ray film of the speech organs at work, how many 
speech sounds have been uttered. A display of acoustic information is 
easier to handle (see illustration on p. 22), but even here it is not always 
possible, because of the way in which many sounds merge into one 
another, to delimit exactly the beginning and end of sound segments. 
Moreover, even if it were possible to identify the main characteristics of 
certain sounds without being sure of their limits, it would not follow 
that the phonetic statement we might accordingly make concerning the 
sequence of sound segments would be a useful one in terms of the 
language which we were investigating. Thus, the word fot is frequently 
pronounced in the London region in such a way that it is possible to 
identify five sound segments: [t], [s], [h], [o], [t]. Yet much of this 
phonetic reality may be discarded as irrelevant when it is a question of 
the structure of the word fot in terms of the sound system of English. 
Indeed, the speaker himself will probably feel that the utterance tot 
consists of only three ‘sounds’ (not only because of the influence of the 
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spelling), such a judgment on his part being a highly sophisticated 
one which results from his experience in hearing and speaking 
English. In other words, the [s] and [h] segments are to be treated 
as part of the linguistic unit /t/.! The phonetic sequence [tsh] does 
not, in an initial position in this type of English, consist of three 
meaningful units; in other languages, on the other hand, such a 
sequence might well constitute three linguistic units as well as three 
phonetic segments. 

This same example illustrates how different sounds may count, in 
respect of their function in a language, as the same linguistic unit. In 
such a pronunciation of fof asis noted above, the first realization of /t/ 
might be described as consisting of:— 


(1) a voiceless stop made by the tongue tip and rims against the 
alveolar ridge and side teeth; 

(2) a slow release of the compressed air, so that friction is 
heard—TIs]; 

(3) the complete disengagement of the tongue from the roof of the 
mouth, so that no friction is caused in the mouth; but an interval 
before the beginning of the next sound, during which there is friction 
in the glottis (and voiceless resonance in the supra-glottal 
cavities)—[h]. 


The second manifestation of /t/, on the other hand, might have an 
articulation which could be described phonetically as follows:— 


(1) an alveolar stop made as before, but with a simultaneous stop 
made in the glottis; 

(2) the glottal closure is released, but the oral stop is retained 
Slightly longer, during which time the air escapes (and respira- 
tion continues) through the nasal passage, the soft palate being 
lowered. 


The first [t] might be briefly described as a voiceless alveolar plosive, 
released with affrication and aspiration; the second as an unexploded 
voiceless alveolar plosive made with,a simultaneous glottal stop. These 
two different articulations, with the resultant difference of sound, 
function, nevertheless, as the same linguistic unit, the first sound 
occurring predictably under strong stress initially in a syllable and the 
second being a typical manifestation of the unit in a final position. Such 
an abstract linguistic unit, which will include sounds of different types, 
is called a phoneme; the different phonetic realizations of a phoneme 
are known as its allophones. 


| Note that it is customary to distinguish sound segments from linguistic sound units 
(phonemes) by using [ ] to enclose the former and / / to enclose the latter. 
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5.2 The Linguistic Hierarchy 


It is clear, as we hinted in Chapter 1, that speech and language require 
in their analysis different orders of units. An utterance, on the concrete 
speech level, will consist of the continuous physiological activity which 
results in a continuum of sound; the largest unit will, therefore, be the 
span of sound occurring between two silences. Within this unit of vary- 
ing extent it may be possible to find smaller segments. It is, however, 
from the abstract, linguistic level of analysis that we receive guidance as 
to how the utterance may be usefully segmented in the case of any 
particular language. We might find, for instance, that an utterance such 
as ‘The boys ran quickly away and were soon out of sight’ is spoken 
without a pause or interruption for breath; it might be said to constitute 
a single breath group on the articulatory level. But, on the linguistic 
level, we know that this utterance is capable of being analysed as a 
sentence consisting of two clauses. Moreover, certain extensive 
sequences occurring within the utterance might be meaningfully 
replaced by other sound sequences, e.g. boys might be replaced by 
dogs, ran by walked, quickly by slowly, etc. These replaceable sound 
sequences are able to stand by themselves and are called words. In 
written forms of language, it usually happens that words are separated 
from each other by spaces, this being a sophisticated convention which 
is not reflected in speech. (We shall see, however, in Chapter 11, that 
words may retain at their boundaries certain characteristics in con- 
nected speech, so that their presence and span is signalled on a phonetic 
as well as a linguistic level.) Yet there are meaningful units smaller than 
the word. The word boys may be divided into boy and s ([z]), where the 
presence or absence of [z] indicates the plural or singular form; quickly 
may be said to consist of quick and the adverbial suffix /y. There are 
smaller sound sequences which may be interchanged meaningfully, but 
which may or may not be capable of standing by themselves. These 
smaller units, known as morphemes, may correspond with words (root 
morphemes), e.g. boy, in which case they may stand alone, or exist at a 
level lower than the word, e.g. [z], in which case they will not normally 
occur other than in association with a word. There is, however, yet a 
lower level at which meaningful commutation is possible. The word ran 
is also a morpheme; but, if instead of saying [reen] we say [ran], we 
have, by changing an element on a lower level than the morpheme, 
changed the meaning and function of the word. This basic linguistic 
element, beyond which it is not necessary to go for practical purposes, 1s 
what we have already referred to as a phoneme. A phoneme may, there- 
fore, be thought of as the smallest contrastive linguistic unit which may 
bring about a change of meaning. 
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5.3 The Phoneme, a Contrastive Unit 


It is possible to establish the phonemes of a language by means of a 
process of commutation or the discovery of minimal pairs, i.e. pairs of 
words which are different in respect of only one sound segment. The 
series of words pin, bin, tin, din, kin, chin, gin, fin, thin, sin, shin, win 
supplies us with twelve words which are distinguished simply by a 
change in the first (consonantal) element of the sound sequence. These 
elements of contrastive significance, or phonemes, we may symbolize 
as/p,b,t,d,k,tf,d3,f,0,s, f, w/. But other sound sequences will show 
other consonantal oppositions, e.g.:— 


(1) tame, dame, game, lame, maim, name, adding /g, 1, m, n/ to 
our inventory; 

(2) pot, tot, cot, lot, yacht, hot, rot, adding /j, h, r/; 

(3) pie, tie, buy, thigh, thy, vie, adding /Ó, v/; 

(4) two, do, who, woo, Zoo, adding /z/. 


Such comparative procedures reveal twenty-two consonantal 
phonemes capable of contrastive function initially in a word. 

§.3.1 It is not sufficient, however, to consider merely one position 
in the word. Possibilities of phonemic opposition have to be investi- 
gated in medial and final positions as well as in the initial. If this is done 
in English, we discover in medial positions another consonantal 
phoneme, /3/, cf. the word oppositions /etter, leather, leisure or seater, 
seeker, Caesar, seizure. This phoneme /3/ does not occur in initial 
positions and is rare (e.g. in rouge) in final positions. Moreover, in final 
positions, we do not find /h/ or /r/, and it is questionable whether we 
should consider /w, j/ as separate, final contrastive units (see §§7.2, 
7.5). We do, however, find one more phoneme that is common in 
medial and final positions but unknown initially, viz. /n/, cf. simmer, 
sinner, Singer or Some, son, sung. 

Such an analysis of the consonantal phonemes of English will give us 
a total of twenty-four phonemes, of which four (/h, r, 3, n/) are of 
restricted occurrence—or six, if /w, j/ are not admitted finally. Similar 
procedures may be used to establish the vowel phonemes of English (see 
Chapter 7). 

The final inventories of vowel and consonant phonemes will con- 
stitute a statement of the total oppositions in all positions in the word or 
syllable; when any particular place in the word or syllable is taken into 
consideration, the number of terms in the series of oppositions is likely 
to be more restricted. 


5.3.2 Diversity of Phonemic Solutions 


It is important to emphasize the fact that 1t is frequently possible to 
make several different statements of the phonemic structure of a 
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language, all of which may be equally valid from a logical standpoint. 
The solution chosen will be the one which is most convenient as regards 
the use to which the phonemic analysis is to be put. Thus, one solution 
might be appropriate when it is a question of teaching a language to a 
particular group of foreigners, when similarities and differences 
between two languages may need to be underlined; another solution 
might be appropriate if it is a question of using the phonemic analysis as 
a basis for an orthography, when social considerations (1.e., for 
example, relations of the people with other countries having particular 
orthographic conventions) have to be taken into account. Even without 
such considerations, discrepancies in analysis frequently arise in the 
case of such sound combinations as affricates (e.g. [tf, d3, tr, dr]) and 
diphthongs (e.g. [el, 9U, al, aU]), which may be treated as single 
phonemes or combinations of two. Such problems concerning English 
will be dealt with briefly when vowels and consonants are considered in 
detail. 


5.3.3 Relevant Features of Phonemes 


Up to now we have obtained an inventory of phonemes for English 
which is no more than a set of relationships of oppositions. The essence 
of the phoneme /p/, for instance, is that itis not /t/ or /k/ or /s/, etc. 
This is a negative definition, which it is desirable to amplify by means of 
positive information of a phonetic type. Thus, we may say that /p/ Is, 
from a phonetic point of view, characteristically fortis and voiceless 
(compared with such lenis, voiced sounds as /b/ or /d/); bilabial (com- 
pared with the places of articulation of such sounds as /t/ or /k/); oral 
(as compared with /m/ or /n/); a stop or plosive (compared with /1/ or 
/r/ or /f/). The /p/ phoneme may, therefore, be identified positively 
by stating the combination of features which are significant within the 
English system: fortis, voiceless, bilabial, oral stop. This is a label 
which will be applicable to the phoneme in most of its realizations 
(‘most’, because in such a word as cup-full/p/ may well be realized as a 
labio-dental stop. Such a case is covered if the term ‘labial’ is used 
rather than ‘bilabial’).! 


l [t is possible, by extracting from the phonemes of a language the distinctive features 
which combine to form such phonemes, to account more economically for the sound 
system in operation in terms of fundamental phonological elements. For an elaboration 
of such ‘distinctive feature’ analysis in articulatory and acoustic terms, see particularly R. 
Jakobson, C. G. M. Fant and M. Halle, Preliminaries to Speech Analysis, 4th ed., 
Cambridge, Mass., 1963; C. G. M. Fant, Theory of Distinctive Features, Speech Trans- 
mission Lab. K.T.H., 1966; N. Chomsky and M. Halle, The Sound Pattern of English, 
New York, 1968. 
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5.3.4 Allophonic Variants 


No two realizations of a phoneme (its allophones) are the same. This is 
true even when the same word is repeated; thus, when the word cat is 
said twice, there are likely to be slight phonetic variations in the two 
realizations of the phoneme sequence /k + æ + t/. Nevertheless, the 
phonetic similarities between the utterances will probably be more 
striking than the differences. Allophones of the same phoneme occur- 
ring in different words or in different situations in a word will, 
however, frequently show considerable phonetic differences. We have 
seen (§5.1) how different the initial and final allophones of /t/ in the 
word fot may be. Or again, the [k] sounds which occur initially in the 
words key and car are phonetically clearly different: the first can be felt 
to be a forward articulation, near the hard palate, whereas the second is 
made further back on the velum. This difference of articulation is 
brought about by the nature of the following vowel, [1:] having a more 
advanced articulation than [a:]; the allophonic variation is in this case 
conditioned by the context. The two [I] sounds of lull [lat] show a varia- 
tion of a different kind. The first [1], the so-called ‘clear’ [1] with a front 
vowel resonance, has a quality very different from that of the final [t], 
the ‘dark’ [t] with a back vowel resonance. Here, however, the dif- 
ference of quality is related to the position of the phoneme in the word 
or syllable and is not dependent upon the phonetic context, i.e. the 
adjacent sounds. It is possible, therefore, to predict in a given language 
which allophones‘of a phoneme will occur in any particular context or 
situation: they are said to have a complementary distribution, no two 
realizations of the same phoneme which have a markedly different 
phonetic quality occurring in the same situation. Such a generalization 
does not take into account those variant realizations of the same 
phoneme in the same situation which may constitute the difference 
between two utterances of tlle same word. When the same speaker 
produces slightly different pronunciations of the word cat, the different 
realizations of the phonemes are said to be ‘free variants’. But, if both 
[1] and [t] occurred in the same situation or context and had the function 
of distinguishing words, the two sotinds would have to be treated as 
realizations (allophones) of different phonemes. 

It is usually the case that there remains some phonetic similarity 
between the allophones of a phoneme: both the [1] sounds discussed 
above, as well as the voiceless fricative variety which follows /p/ or /k/ 
in words such as please and clean, are lateral articulations. But the 
Cockney realization of dark [tł] is often [o] (as in /itt/e [li?0]), in which 
case, the distribution of the sounds still being complementary, we have 
to envisage the possibility of allophones belonging to the vocalic and 
consonantal types or are obliged to reorganize our phonemic analysis. 
Again, it sometimes happens that two sounds occur in complementary 
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distribution, but are not treated as allophones of the same phoneme 
because of their total phonetic dissimilarity. This is the case of [h] and 
[n] in English; they are never significantly opposed, since [h] occurs 
typically in initial positions in the syllable or word, and [n] in final 
positions. A purely logical arrangement might include these two sounds 
within the same phoneme, so that Aung might be transcribed phone- 
mically as either /hah/ or /nan/; but such a solution would ignore the 
total lack of phonetic similarity and also the feeling of native speakers. 
The ordinary native speaker is, in fact, often unaware of the allophonic 
variations of his phonemes and will, for instance, say that the various [1] 
allophones we have discussed are the ‘same’ sound; [h] and [g], how- 
ever, he will always consider to be ‘different’ sounds. When he makes a 
statement of this kind, he is usually referring to the function of the 
sounds in the language system and can thereby offer helpful, intuitive 
information regarding the phonemic organization of his language, 
especially when it 1s a question of the first phonemic analysis of a 
language which has no written form. In the case of a language such as 
English, which already has an alphabetic orthography (however incon- 
sistent), prejudices induced by the existence of such a written form have 
naturally to be taken into account in evaluating the native speaker’s 
reaction. 


5.3.5 Neutralization of Phonemes 


It sometimes happens that a sound may be assigned to either of two 
phonemes with equal validity. In English, examples of this kind are to 
be found in the plosive series. The main contrastive feature between the 
pairs pin/bin, team/deem, come/gum resides in the presence of aspira- 
tion in /p, t, k/ and its absence in /b, d, g/, presence or absence of 
voice being usually irrelevant in this initial, accented position, where 
/b, d, g/ are largely or totally devoiced. When /p, t, k/ follow an 
initial /s/, however, they are realized with no aspiration even when 
stressed. Thus, in the case of such words as spin, steam, scum, we have 
three plosive phonemes which lack the aspiration often characteristic of 
/p, t, k/, but which do not have the voice which sometimes accom- 
panies /b, d, g/. Since /p, t, k/ are never opposed to /b, d, g/ follow- 
ing /s/ in this position, the words might, therefore, be transcribed 
phonemically either as /spin, sti:m, skam/ or as /sbin, sdi:m, sgam/ 
without ambiguity. ! 

Another case concerns the allophones of /m/ and /n/ before /f/ or 


'Experimental investigations suggest that perceptually and, to some extent, acous- 
tically /p, t, kZ following /s/ have more in common with initial /b, d, g/ than with 
initial /p, t, k/ (see N. Davidsen-Nielsen, ‘English Stops after initial /s/’, English 
Studies, no. 4, August, 1969). 
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/v/, in words like symphony and infant. The nasal consonant in each 
case is likely to be [m] in rapid speech, i.e. a labio-dental sound anti- 
cipating the labio-dental [f]. Here again, /m/ and /n/ are not opposed, 
so that the sound could be allocated to either the /m/ or the /n/ 
phoneme. In practice, since in a slow pronunciation the [m] sound 
would tend to be used in symphony and the [n] sound in infant, a 
phonemic differentiation is usually made. 


5.3.6 Conditions of Phonemic Analysis 


Statements concerning phonemic categories and allophonic variants 
can usually be made in respect of only one variety of one language. lt 
does not follow, because, for instance, [l] and [t] are not contrastive in 
English and belong to the same phoneme, that this is so in other 
languages—in some kinds of Polish [l] and [t] constitute separate 
phonemes. Or again, although /n/ is a phoneme in English, in Italian 
the velar nasal [n] is an allophone of /n/ which occurs before /k/ and 
/g/. Indeed, in English, too, /n/ has not always had phonemic status. 
Nowadays, [yn] might be considered an allophone of /n/ before /k/ and 
/g/, as in sink and finger, were it not for the fact that the /g/ in words 
such as sing was lost about 400 years ago; once this situation had arisen, 
a phonemic opposition existed between sin and sing. 

It should also be noted that the number of phonemes may differ as 
between various types of the same language. Thus, in present-day 
Southern British English, the words cat, half, cart contain the 
phonemes /ze/, /a:/, and /a:/ respectively. But another type of 
English may have only one vowel phoneme for all three words, the 
words being phonemically /kat, haf, kart/ (the post-vocalic [r] being 
pronounced). Such a dialect! of English would have one phoneme less 
than Southern British English, since the opposition Sam/psalm is lost. 
On the other hand, this deficiency in number might be made up by the 
regular opposition of the first elements of such a pair as witch/which, 
which would establish a phonemic contrast between /w/ and /m/. 

It should not, however, be assumed that the phonemic systems of two 
dialects are different merely because the phonetic realizations of the 
phonemes are dissimilar. The sound sequence [set], 1.e. with a vowel in 
the region of cardinal 3, may bea realization of satin one dialect and of 
set in another; the phonemic categories commonly represented as /1:, 1, 
e, ee, a:/, etc., may nevertheless be present in both dialects, the vowel 
system in the first dialect being somewhat more ‘closed’ than that of the 
second. Or again, the diphthong [au] is a realization of the phoneme of 
boat in educated Southern British, but is frequently a realization of the 


INote that ‘dialect’ includes pronunciation, grammar and vocabulary; ‘accent’ is 
confined to pronunciation. 
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vowel in boot ina type of London popular speech; the same number of 
vowel phonemes occurs in both kinds of English. Moreover, two 
speakers of the same dialect may distribute the same number of 
phonemes differently among the words they use. In Southern British 
English, some will say off, cloth, cross with /o:/, others with /p/; some 
will say elastic with /z/ in the second syllable, others /a:/; some will 
say /'ju: nizn/ for unison, others /'ju: nisn/, etc. Ideally, phonemic 
analysis should be applied to the speech of one person (to one /diolect). 
Individuals are, however, inconsistent; in certain situations, they may 
change the number of their phonemes, e.g. the occasional use of /m/; 
they may not always use the same phoneme in a particular word or 
group of words, e.g. the erratic use, in the same person’s speech, of /D/, 
or /d:/ in words of the off group. 

To sum up, we may conclude that a phonemic analysis of a number 
of varieties of one language is likely to reveal: different coexistent 
phonemic systems; considerable phonetic discrepancies in the realiza- 
tions of the phonemes of systems which have an equal number of 
phonemic categories; variation in the distribution of phonemes in 
words, even within a community using the same phonemic system; and 
variation of phoneme distribution, even within the speech of one 
individual, according to the situation. It is important to remember this 
likelihood of complication in both the system and its realization, not 
only in the present situation but also when it is a question of investi- 
gating past states of the language. 


5.3.7 The Word as Basis for Phonemic Analysis 


It frequently happens that a phonemic analysis is based on a unit not 
larger than the word. If any larger section of the utterance is used, the 
analysis becomes a great deal more complicated. As Daniel Jones has 
pointed out,! the two phrases plum pie and plump eye exemplify the 
sort of complication which will arise. The two phrases differ principally 
in the aspiration which accompanies the /p/ of pie but which is absent 
from the final /p/ of plump. A phonemic analysis which is based on 
such an extensive sound sequence would require the establishment of 
two /p/ phonemes, one with, one without aspiration. The difficulty is 
avoided if the word is treated as a complex phonetic and phonemic 
entity, special consideration being given to word boundaries in the 
utterance. If, however, the linguistic analysis is based on a sequence 
more extensive than the word, a mark of contrast has to be established 
in order to deal with the behaviour of phoneme sequences at word or 
morpheme boundaries (see ‘Juncture’, Chapter 11, §11.3.8). 


¡Daniel Jones, ‘Some Thoughts on the Phoneme,’ Transactions of the Philological 
Society, 1944. 
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5.4 The Syllable 


The concept of a unit at a higher level than that of the phoneme or 
sound segment, yet distinct from that of the word or morpheme, has 
existed since ancient times. It is significant that most alphabets, such as 
our own, which have as their basis the representation of phonemes by 
letters (however approximately), have reached this state by way of a 
form of writing which symbolized a group of sounds—a syllabary. 
Indeed, the basis of the writing of many languages, e.g. that of the 
Semitic group, remains syllabic in this sense. The notion that there 
exists at this higher level a unit known as the syllable has led to many 
attempts in recent times to define the term. These attempts fall into two 
main categories: those which seek a universal definition in phonetic 
terms and those which look for a specific functional definition in terms 
of a particular language. In the following sections, some of the many 
approaches are outlined. 


5.4.1 The Phonetic Approach 


(1) The Prominence Theory.—In any utterance some sounds are said 
to be more ‘prominent’ or ‘sonorous’ than others, 1.e. they are felt by 
listeners to stand out from their neighbours. In the word sonority 
/sa'noraty, such ‘peaks’ of prominence are carried by /a, b, 2, Y. The 
number of syllables being determined by the number of peaks of promi- 
nence, there are in this case four syllables. Syllable boundaries occur at 
the points of relatively weak prominence (‘valleys’), i.e. at /s, n, T, t/. 
This theory, which is based mainly on auditory judgments, does not 
determine to which syllable the weak sound, constituting the boundary 
between two syllables, is to be attributed. Moreover, difficulties are 
encountered in the case of languages such as English which permit con- 
sonant clusters. The word extra, for instance, transcribed as /'ekstro/, 
would show by the same procedure three peaks of prominence on 
/e,s, 3/, thus forming three syllables. Such a solution is at variance 
with the native feeling for a division at a higher level than the phoneme, 
which tends to be interpreted as /ek-stra/ or /eks-tro. 

(2) The Pulse Theory.—A more fundamental approach to the 
syllable is concerned with the muscular activity controlling lung move- 
ment which takes place during speech and which is capable of being 
investigated by experimental methods. It is claimed that in any utter- 
ance there are a number of chest pulses, accompanied by increases in air 
pressure, which determine the number of syllables uttered. Such a 
theory suggests that the syllable rather than the sound Is the basic unit 
of speech, consonantal sounds acting typically at the onset (releasing 
factor) and closure (arresting factor) of the syllable, while vowel sounds 
are nuclear to the syllable and render the chest pulse audible. Again, 
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such a unit on the speech level may prove to be irrelevant or misleading 
on the linguistic level. Particularly when it is a question of the juxta- 
position of two vowel sounds, the second being weakly stressed as in the 
case of seeing /'siz1n/, it is doubtful whether a double chest pulse will be 
evident, although it is clear that the word is to be divided linguistically 
into two units. 


5.4.2 The Linguistic Approach 


A unit of greater magnitude than the phoneme or sound segment is 
more usefully defined in linguistic terms, i.e. with reference to the 
structure of one particular language rather than in general, phonetic 
terms with universal application. It may be found appropriate to divide 
a similar sound sequence differently in different languages; e.g. such an 
utterance as [nga:] might be found to consist of three units in one 
language -/n-ga-a/, two in another /pn-9ga:/, and even one in another 
-/nga: /. The divisions to be made might be indicated by the extent of 
the sequence covered by different tones or stress, in which case the 
syllable would correspond to a unit of tone or stress. Alternatively, a 
statement concerning the syllables of a language may refer to the 
particular way in which phonemes are found to combine in that 
language (the earliest meaning of the term), so that certain phonemes 
are shown to have a typically central situation in the permissible groups, 
whereas others are more typically marginal in their position. Such a 
procedure would not, however, decide in the case of the English word 
/'ekstro/ the point at which the syllabic division is to be made, since /k/ 
is a possible final phoneme and /str/ a permissible initial cluster, and 
yet /ks/ is also possible finally and /tr/ initially; both solutions would 
be valid in the light of the English distributional system. (See $$9.8, 9.9 
for details of the combinatory possibilities of English phonemes in the 
word.) 

A term need not, of course, be defined simply because 1t has existed 
for two thousand years. If experimental procedures provide evidence of 
a unit above the sound segment at the physiological or acoustic levels of 
speech, or if, at the linguistic, structural level, a unit consisting of 
groups of phonemes is found to be useful, then the term ‘syllable’ may 
well be applied to such a unit. It is doubtful, however, whether the term 
can be applied to a unit both of speech and language, any more than 
‘sound’ can cover, without ambiguity, both a phonetic segment and a 
linguistic, constrastive unit. 


5.5 Vowel and Consonant 


It was seen in the previous chapter that attempts to arrive at a universal 
phonetic definition of the two terms, from an articulatory or an 
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auditory standpoint, encountered difficulties as regards certain border- 
line sounds as [I, r, j, w] in English. If, however, the syllable is defined 
linguistically, i.e. from the point of view of distribution of phonemes, a 
solution can be given to most of these problems. It will be found that the 
phonemes of a language usually fall into two classes, those which are 
typically central (or nuclear) in the syllable and those which are non- 
central (or marginal). The term ‘vowel’ can then be applied to those 
phonemes having the former function and ‘consonant’ to those having 
the latter. The frictionless English sounds /}, w, r/, for instance, which 
according to most phonetic descriptions are of a vowel-like (vocoid) 
type, function in the language as consonants, i.e. are marginal in the 
syllable. The English voiced lateral and nasal sounds are commonly 
classed phonetically as of the consonantal (contoid) type because of the 
complete or partial mouth closure with which they are articulated. 
From a functional viewpoint, too, they behave as consonants, since 
they are usually marginal in the syllable. Nevertheless, they sometimes 
seem to fulfil a syllabic function without the presence of a vowel, e.g. in 
middle, button ['midl, batn]. These /l/ and /n/ phonemes can be said 
to be syllabic here in terms of the prominence and pulse theories; they 
also function as separate units in the intonation structure of English. 
Yet there is some difficulty in establishing their syllabic nature in 
distributional terms, since it is possible to compare such word endings 
as /-dz/ and /-ts/, in which /Z/ and /s/ are not usually considered to 
have syllabic function. This difficulty is sometimes avoided by equating 
[l, n] with /al, an/ both on logical grounds and also because / 'midal/, 
for instance, is a possible pronunciation of middle. However, if the 
equation with /a/ + /l, n/ is not admitted, the cases of consonant + 
/s/ or /z/ are not strictly parallel even on distributional grounds. Both 
/1/ and /n/ pattern in such a final situation very much as vowels in that 
they may follow the great majority of consonants; /s/ and /z/, on the 
other hand, have a much more restricted occurrence in this position and 
are, therefore, less vowel-like. 

A further illustration of the consonantal function of /j/ and /w/ is 
provided by the behaviour of the English articles when they combine 
with words beginning with these phonemes. The is pronounced /ó1: / or 
/01/ before a vowel and /03/ before a consonant, whenever it precedes 
the other English phonemes; we also have the forms a or an an accord- 
ing to whether a consonant or vowel follows. Since it is normal to 
pronounce the yacht, the watch with /6a/ and to prefix a to yacht and 
watch rather than an, /j/ and /w/ may be said to behave as if they 
belong to the consonant class of phonemes, despite their vocalic 
quality. 

It is clear that if the elements of the utterance are divided in two 
categories, some units which are assigned to one class according to 
speech (phonetic) criteria may fall into the other class when it is a 
question of linguistic (functional) analysis. 
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5.6 Notation 


The symbolization of an utterance (analysed in terms of a linear 
sequence of sounds) will naturally differ according to whether the aim 
is to indicate detailed sound values—an allophonic (or ‘narrow’) 
transcription—or the sequence of significant functional elements—a 
phonemic (or ‘broad’) transcription. 

(1) In the former, allophonic, type of notation, an attempt is made 
to include a considerable amount of information concerning our 
knowledge of articulatory activity or our auditory perception of allo- 
phonic features. The alphabet of the International Phonetic Associa- 
tion provides numerous diacritics for a purpose such as this; e.g. the 
word titles might have its phonetic features transcribed as ['t*ä: étttz]. 
Such a notation would show the affrication and aspiration of the initial 
[t], the fact that the first element of the diphthong is retracted from 
cardinal 4 and is long compared with the second element, which is a 
retracted cardinal 2 vowel, that the [t] has a back vowel resonance and 1s 
partly devoiced in its first stage, and that the final [z] is completely 
devoiced. Such a notation is relatively explicit and detailed, but gives no 
more than an impression of the complexity of the utterance as revealed 
by the various methods of physiological and acoustic investigation. 
Nevertheless, an allophonic transcription of this kind can be useful for 
comparative purposes, e.g. for the comparison of different types of 
pronunciation, and is a necessary first step in the notation of a language 
for which no phonological system has been worked out. 

(2) In the second type of notation—the phonemic, symbols are allo- 
cated only to the strictly contrastive sound categories of the language; 
a phonemic transcription is highly sophisticated, since a number of 
conventions are implicit in each symbol. The word titles, for instance, 
would be written /'taitlz/, it being assumed that the phonemes occur- 
ring in such a context would have the predictable phonetic realizations 
noted in the preceding paragraph. The number and type of symbols 
used will, of course, depend upon the kind of phonemic solution which 
has been arrived at for the language. As far as English is concerned, 
and notably the English vowel system, several phonemic solutions 
are current and a corresponding number of notation systems exist, 
all of which are justifiable, although the choice of symbols and the con- 
ventions attached to them differ.! The striking discrepancies to 
be found among present-day phonemic transcriptions of British 
English are often due to the varying significance which is attached to 
vowel quantity and to the degree to which the symbols are made to coin- 
cide with the ordinary Latin alphabet. Thus, in the various phonemic 


l! For a discussion of the problems of phonetic transcription, see Daniel Jones, Outline 
of English Phonetics, 1956, Appendix A. 
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transcriptions of Southern British English, the vowels of cotand caught 
are symbolized as /D, 9/ or /D, 9:/ or /2, 95/ or /o, 0:/ or /o, 00/, 
the last three solutions emphasizing the quantitative contrast (see 
Chapter 7 for a discussion of such solutions). 

A phonemic transcription, with its-implicit conventions, is the most 
common and least cumbersome method of indicating unambiguously 
the spoken realization of a language. It may happen that it has a form 
which is close to that of the language’s orthography, e.g. Spanish or 
Serbo-Croat; it is particularly useful as a corrective instrument in the 
case of a language such as English where the orthography fails to mirror 
consistently the present-day pronunciation. A phonemic notation may 
also be used as a basis for teaching the pronunciation of a foreign 
language, the allophonic conventions to be related to a phonemic 
symbol being easily learnt. Sometimes, however, it may be helpful, at 
least in the early stages, to include symbols representing allophones in 
order to emphasize a particular feature of the language, e.g. in teaching 
English, it is often necessary to insist upon the variety of [l] sounds by 
using not only [l] but also [1]. 


5.7 Prosodic Features 


We have so far dealt almost entirely with the description and organiza- 
tion of the qualitative features of an utterance. As we have seen in 
Chapters 2 and 3, a sound has not only quality, whose phonetic nature 
can be described and function in the language determined, but also 
length, pitch, and a degree of stress. Such features may extend in time 
beyond the limits of the phoneme and embrace much higher units of the 
utterance. Indeed, in the case of the pitch variation characteristic of an 
intonation pattern, analysis may involve the whole of a lengthy utter- 
ance. In this sense, such features are prosodic, or supra-segmental. 
All three features may be measured physiologically or acoustically: 
length, as duration; pitch, as the frequency of the fundamental; and 
stress, as a measure of intensity, muscular activity, or alr-pressure. 
Once again, as with qualities, the absolute measurements are less useful 
than an assessment of relative function: lt is, for instance, irrelevant to 
attempt to measure the precise duration of a sound, unless the language 
in question makes some use of length variation as a distinctive feature. 
If, in a language, there appears to be a significant opposition between a 
long and short [a], then it may be worth determining what the distinc- 
tion is in absolute terms; or, if the length of sounds appears to vary con- 
siderably according to the context in which they occur, it is worth estab- 
lishing the facts by measurement. But, in a phonemic notation, it is not 
normally necessary to take into account more than two (or sometimes 
three) significant degrees of length. Similarly, the absolute pitches used 
in speech are less important linguistically than the pitch relationships 
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which a native speaker distinguishes as significant. And again, stress, 
which in the strict sense the listener hears as loudness, rarely requires 
more than two or three degrees to be shown, although the speaker’s 
intensity will show an infinite number of degrees of variation. Thus, as 
with qualities, a preliminary transcription may indicate as many shades 
of duration, pitch, and loudness as possible, but the ultimate notation 
will present functional categories, abstracted from the concrete expres- 
sion. Pitch will be described linguistically in terms of an intonation 
or tone system and duration in terms of contrastive oppositions of 
quantity; as far as stress is concerned, it will have to be established 
whether it has a linguistic reality independent of the other three features 
of quality, quantity and pitch, and, if it has, whether it is necessary to 
take into account more degrees than its simple presence or absence (see 
Part 111 for some discussion of the prosodic features of English). 


5.8 Paralinguistic Features 


In addition to the prosodic features concerned with pitch, rhythm, 
tempo, loudness, etc., there are other characteristics of the utterance 
which are involved in the speech communication process and which are 
categorized as paralinguistic. In the first place, an identification of the 
speaker relies largely on our recognition of his own peculiar voice 
quality (or ‘voice set’) which itself is the result of the anatomical 
structure of his vocal organs and of his articulatory habits. Secondly, he 
may make special modifications of his voice quality for a particular 
effect: e.g., he may use a whisper or a breathy or creaky voice or 
falsetto; or again, he may be stimulated to accompany his utterance 
with a laughing or sobbing or whining feature. This second category 
differs from the first in that the features involved are ones which are 
universally used in his own language (and in most others) as signals 
denoting particular mental or emotional states and which are therefore 
capable of classification (to the extent that such mental or emotional 
states can be categorized). ! 


¡See entries in bibliography under Crystal, Quirk, Catford, Trager, Laver. 
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The Historical Background 


6.1 Phonetic Studies in England 


Although linguistic science has made rapid and spectacular progress in 
the present century, 1t is not merely in modern times that speech and 
language have beer the object of serious study. Extensive accounts of 
the pronunciation of Greek and Latin were written two thousand years 
ago and, in India, at about the same time, there appeared detailed 
phonological analyses of Sanskrit, which reveal remarkable affinities 
with modern ways of thought—‘These early phoneticians speak in fact 
to the twentieth century rather than to the Middle Ages or even to the 
mid-nineteenth century . . .** In this country, too, printed works con- 
talning information of a phonetic kind extend back for at least four 
hundred years. It is true that the very earliest writers in England rarely 
had as their main interest a purely phonetic investigation; and the 
descriptive accounts which they provided are less rigorous and satis- 
factory, by modern standards, than those of the Indian grammarians. 
But, by the seventeenth century, we find a considerable body of 
published work, which is already entirely phonetic in character and 
which contains observations and theories still adhered to to-day. 
6.1.1 Some of the first writers whose work we possess were con- 
cerned with the relation between the sounds of English and those of 
another language. Thus, John Palsgrave’s French grammar Lesclar- 
cissement de la Langue Francoyse (1530) includes a section which deals 
with the pronunciation of French, much as any modern grammar 
would. In order to explain the values of the French sounds, Palsgrave 
compares them with the English. This is done in no objective fashion, 
and it is not easy for us now to know what precise sound is indicated in 
either language. But this difficulty of communicating sound values in 
print—especially those of vowels—is one which will be shared by all 
writers until some system of objective evaluation, such as that of the 
Cardinal Vowels, is devised. At least, John Palsgrave was sufficiently 
aware of divergent associations of letters and sound to provide some 


IW. S. Allen, Phonetics in Ancient India, Oxford University Press, 1953, p. 7. 
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passages of French in a kind of phonetic transcription. Another early 
writer concerned with pronunciation is William Salesbury, a 
Welshman, whose Dictionary in Englyshe and Welshe (1547) contained 
comments on the sounds of English. Sound values are indicated by 
means of a method of transliteration in Welsh or English. Indeed, 
though grammars of foreign languages published during the next three 
centuries will increasingly attempt more exact description and com- 
parison of sounds, for the great majority of them the section devoted to 
pronunciation will rely mainly on transliteration for indicating 
approximate values. Even to-day, grammars of foreign languages 
frequently make use of this approximate method of ‘simulated’ 
pronunciation. 

6.1.2 Amore important type of phonetic inquiry stemmed from the 
activities of those who, particularly in the sixteenth and seventeenth 
centuries, were concerned at the increasing inconsistency of the rela- 
tionship of Latin letters and the sounds which they represented, 
especially in English. There had been during the previous five or six 
centuries great changes of pronunciation, particularly as far as the 
vowel sounds were concerned, so that letters no longer had their 
original Latin values. The same sound could be written in a number of 
ways, or the same spelling do service for several sounds; moreover, the 
same word might be spelt in different ways by different writers. Thus, 
four hundred years before the activities of Bernard Shaw and the 
Simplified Spelling Society of to-day, men were aware of the need to 
bring some order into English spelling. During the four centuries that 
have elapsed since these early efforts, our pronunciation has continued 
to evolve without any radical changes of spelling having been made, 
with the result that to-day discrepancies between sound and spelling are 
greater than they have ever been. It can, however, be said that for more 
than two hundred years our spelling forms, inconsistent though they 
may be as far as sound symbolization is concerned, have been 
standardized. 

The.early spelling reformers were obliged, if they were to propose a 
more logical relationship of sound and spelling, to investigate the 
sounds of English. A writer such as Thomas Smith, De recta et emen- 
data linguae anglicae scriptione (1568), makes many pertinent phonetic 
comments on such matters as the aspiration of English plosives and the 
syllabic nature of /n/ and /l/, as well as providing correct descriptions 
of the articulation of consonants. Yet, he, as a phonetician, is over- 
shadowed in the sixteenth century by John Hart, whose most important 
work, the Orthographie, was published in 1569. Besides making out his 
case for spelling reform and proposing a revised system, Hart describes 
the organs of speech, defines vowels and consonants (distinguishing 
between front and back vowels and between voiced and voiceless con- 
sonants), and notes the aspiration of voiceless plosives. Of the 
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numerous seventeenth-century orthoepists, only Alexander Gil, 
Logonomia Anglica (1619, 1621), can be compared with Hart on the 
phonetic level, though even his observations lack the objectivity of 
Hart’s. 

6.1.3 If the writers mentioned above used phonetic methods of 
analysis and transcription as a means to their end of devising an 
improved spelling, there emerged in the seventeenth century a group of 
writers who were interested in speech and language for their own sake. 
Because of their preoccupation with detailed analysis of speech activity; 
the comparative study of the sounds of various languages, the classi- 
fication of sound types, and the establishment of systematic relation- 
ships between the English sounds, they can be said to be the true pre- 
cursors of modern scientific phoneticians. Two of the most celebrated, 
John Wallis and Bishop Wilkins, were among the founders of the Royal 
Society; and, indeed, Isaac Newton, the greatest of the early members 
of the Society, was interested in phonetic analysis and has left notes of 
his own linguistic observations. Language was considered a proper 
object of the attention of the writers of this new scientific age, their view 
of speech and pronunciation being set against a framework of the 
universal nature and characteristics of language. 

The linguistic fame of John Wallis, primarily a mathematician, 
spread throughout Europe and lasted into the eighteenth century, his 
work being copied long after his death. His principal linguistic work, 
Grammatica Linguae Anglicanae, was first published in 1653 and the 
last authoritative edition appeared in 1699; but other, unauthorized, 
editions continued to appear in the eighteenth century, the last being 
dated 1765. Wallis intended his Grammar to help foreigners to learn 
English more easily and also to enable Englishmen to understand more 
thoroughly the true nature of their language. He admits in his preface 
that he is not the first to undertake such a task, but claims that he does 
not seek to fit English into a Latin mould, as most of his predecessors 
had done, but rather to examine the sounds of English as constituting a 
system in their own right. By his methods, he says, he has succeeded in 
teaching not only foreigners to pronounce English correctly but also the 
deaf and dumb to speak. The introductory part of the work (Tractatus 
de Loquela), besides giving a short history of English, describes in 
detail the organs of speech and attempts to establish a general system of 
sound classification which will do service for all languages (illustrations 
of qualities are taken from French, Welsh, German, Greek, and 
Hebrew as well as English). Vowels are classified in Guttural, Palatal, 
and Labial categories, subdivided into Wide, Medium, and Narrow 
classes. The degrees of aperture are similar to those which are used even 
to-day, but the divisions Guttural, Palatal, and Labial, which take into 
account both the area of raising of the tongue and also lip action, show 
a confusion of dimensions not to be found in more modern analyses. 
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Consonants, like vowels, are divided into three classes: Labial, Palatal, 
and Guttural, being different from vowels in that the air-stream from 
the lungs is obstructed or constricted at some point. Wallis remarks that 
the air-stream may pass entirely through the mouth, almost completely 
through the nose, or almost equally divided between the mouth and the 
nose, the position of the uvula determining the difference of direction. 
Thus, nine basic consonantal articulations are postulated. In addition, 
the air-stream may be completely shut off (Closed or Primitive) or 
merely constricted (Open, Derivative, or Aspirate), the latter being arti- 
culated with a narrow aperture or with a wider, rounder, opening. The 
‘closed’ consonants (stops) consist of the mutes [p, t, k],! the semi- 
mutes [b, d, g], and the semi-vowels [m, n, n]. The corresponding 
‘open’ or ‘aspirated’ consonants are: mute [f, s, x], semi-mutes 
[v, z, x] or [y]; and those with a wider opening: mute [f] (again), [6, h], 
semi-mutes [w, ð, j]; [l, r] are related to the [d] or [n] articulations; 
[f, 3] are regarded as compound sounds. Wallis’s detailed remarks on 
the pronunciation of English are made in terms of this general system 
stated in the Loquela. It will be seen that such a classification, despite 
errors and inadequacies which are apparent to-day, represents a serious 
attempt at the establishment of universal sound categories. Although 
the elements of Wallis’s system have been quoted briefly here, it should 
be pointed out that his is merely the most celebrated of a number of 
similar analyses made at about the same time. 

His fellow member of the Royal Society, Bishop John Wilkins, 
published in 1668 an Essay towards a Real Character and a Philo- 
sophical Language. Written in English, this work of 454 pages, with a 
dictionary appended, is of much wider scope than that of Wallis, since 
it aims at no less than the creation of a universal language, expressed by 
means of ‘marks, which should signifie things, and not words’. Wilkins 
acknowledges his debt to his contemporary linguists, especially in 
respect of the account of pronunciation which forms a comparatively 
small part of the Essay. Wallis, he says, ‘seems to me, with greatest 
Accurateness and subtlety to have considered the Philosophy of 
Articulate sounds.’ Wilkins, too, describes the functions of the speech 
organs and gives a general classification of the sounds articulated by 
them; his treatment of consonants is in fact more satisfactory than that 
of Wallis. He claims that the thirty-four letters which he proposes for 
his alphabet are sufficient ‘to express all those articulate sounds which 
are commonly known and used in these parts of the world’. In his 


1 Wallis’s own symbols are here replaced by IPA equivalents, but it is not always clear 
from Wallis’s description which sounds are intended: thus, his description of ch and gh 
would seem to indicate [x] for both, though in the system one would expect gh to mean 
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account of the values of the letters, reference is made not only to Euro- 
pean sound systems but also to such little known languages as Mexican, 
Armenian, Arabic, Japanese, and Chinese. 

Any account of seventeenth-century phoneticians should include the 
name of Christopher Cooper. Though he did not achieve the great 
European reputation of Wallis, he is considered by many to be the 
greatest English phonetician of the century. His work on English pro- 
nunciation was first published in 1685 (Grammatica Linguae Angli- 
canae), with an English edition appearing in 1687 (The English 
Teacher, or The Discovery of the Art of Teaching and Learning the 
English Tongue). A schoolmaster rather than a member of the Royal 
Society, Cooper was less concerned than many of his contemporaries 
with the establishment of universal systems. His aim was to describe 
and give rules for the pronunciation of English for ‘Gentlemen, Ladies, 
Merchants, Tradesmen, Schools and Strangers’, rather than to devise a 
logical system into which the sounds of English and other languages 
might be fitted. Moreover, he deals with the spelling of English as it 
exists and does not seek to reform it. The first section of his book is con- 
cerned with the description of speech sounds (‘The Principles of 
Speech’) and the second part gives rules for the relation of spelling and 
pronunciation in different contexts. Ccoper describes the organs of 
speech and names those sections of the upper speech tract which are 
mainly responsible for the articulation of the ‘breath’: ‘guttural, 
lingual, palatine, dental, labial, lingua-palatine, lingua-dental.? Those 
sounds in the production of which the air-stream 1s ‘straitned or inter- 
cepted’ are consonants (classified as semi-vowels, aspirated, semi- 
mutes, and mutes), while those in which the air-stream is ‘freely emitted 
through the nostrils or the lips’ are vowels. He notes that voice, ‘made 
by a tremulous concussion of the larynx,’ is a characteristic of vowels, 
semi-vowels, and semi-mutes. His classification of vowels is in terms of 
lingual, labial, and guttural categories, a somewhat confusing distinc- 
tion being made between the English long and short vowels. Diphthongs 
are defined properly as ‘a joyning of two vowels in the same syllable, 
wherein the power of both is kept’. His consonantal classification, with 
IPA equivalents here, shows: labial" sounds, subdivided into semi- 
vowels [w, v, m], aspirated [m, f, m}, semi-mute [b], mute [p]; lingual 
sounds, subdivided into semi-vowels [z, 3, 0, n, l, r, j], aspirated 
[s, $, 0, n, l, r, ç], semi-mute [d], mute [t]; guttural sounds, subdivided 
into semi-vowels [y, y, h], aspirated [x, $, h], semi-mute [g], mute [k]. 
The second part of the work, dealing with the pronunciation of various 
English spelling forms, provides more specific information about the 
pronunciation of English than is to be found in the work of any other 
writer in this period. Numerous examples are given, e.g. more than 
three hundred cases of the -tion suffix pronounced with [f]; words are 
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listed which have either the same pronunciation with different spellings 
or the same spellings with different pronunciation; and rules are given 
for the accentuation of words. 

It will be seen from the mention of these few names, chosen from the 
many who were writing on matters of pronunciation in the seventeenth 
century and omitting those who were composing spelling books and 
grammars for foreigners, compiling lists of homophones and devising 
systems of shorthand, that there was at this time a surge of scientific 
and analytical interest in speech and pronunciation such as was not to 
be repeated until the nineteenth century. It is true that the judgments 
made were largely intuitive, but this was to remain the case in phonetic 
research until the second half of the nineteenth century. In their theore- 
tical approach, however, many of these early writers show a preoccupa- 
tion with classification, systematization, and problems of distribution 
which is paralleled in the activities of modern linguists. 

6.1.4 The spirit of general scientific inquiry into speech which 
characterized a large proportion of the phonetic work of the seven- 
teenth century had, by the eighteenth century, lost much of its original 
enthusiasm. Prescriptive grammars containing rules for pronunciation 
continued to be produced in large numbers and provide us with 
information concerning the contemporary forms of pronunciation; 
shorthand systems, too, which show an undiminished popularity, 
necessitated the analysis of English into its constituent sounds. Yet the 
main achievement of the century lies in its successful attempt to fix the 
spelling and pronunciation of the language. Dictionaries had been 
published in the seventeenth century, but the works having the main 
stabilizing and standardizing influence on the language were to be the 
Dictionaries of Samuel Johnson (1755), Thomas Sheridan (1780), and 
John Walker (1791), the last two writers being particularly concerned 
with the standardization of pronunciation. John Walker, whose 
dictionary is called by the Dictionary of National Biography ‘the statute 
book of English orthoepy’, exerted a great influence on the teaching of 
English not only in this country but also in America. Moreover, he pays 
considerable attention in his work to the analysis of intonation, treated 
but perfunctorily by most earlier writers. In this respect, however, he 
owes much to Joshua Steele’s remarkable Prosodia Rationalis 
(1775—79), in which there is presented a system of notation capable of 
expressing pitch changes, stress, and rate of delivery. (Steele is cele- 
brated for his detailed analysis of a soliloquy delivered by David 
Garrick.) 

We have, in fact, been dealing up to now with two types of work on 
pronunciation, which, especially in the eighteenth century, came to be 
confused: on the one hand, and in the minority, the books which laid 
emphasis on description, analysis, and classification; on the other, the 
books which were mainly normative and continue the tradition of 
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‘rhetoric’. That part of rhetoric known as ‘elocution’ originally 
referred to the style and form of speech, ‘the garnishing of Speech,’ but 
in the eighteenth century the term was increasingly applied to the 
method of delivery. It is not until the nineteenth century that a clear 
distinction will be made between the aesthetic judgments upon which 
elocution largely relies and the objective descriptive statements which 
form the basis of phonetic analysis. Until such a distinction is explicitly 
made, advances in phonetic techniques have to be disentangled from a 
mass of irrelevant opinion. 

6.1.5 Inthe nineteenth century the English traditional preoccupa- 
tion with phonetic notation and the simplification of English spelling 
continues. Isaac Pitman (1813—97), whose system of shorthand is so 
widely used to-day, and Alexander J. Ellis (1814-90), concerned at the 
difficulties which our spelling presented to English children as well as to 
foreigners, devised an alphabet (Phonotype, 1847) which conformed to 
a phonetic analysis of English and yet remained based upon the Latin 
characters. They were supported by the Phonographic Society and 
published a journal which eventually (1848) was named the Fonetic 
Jurnal. Ellis, however, developed other types of alphabet, notably 
Glossic, which is essentially an adaptation of traditional spelling, and 
Palaeotype, which used conventional letter shapes but in a great variety 
of type, so that fine shades of sound could be symbolized. This latter 
alphabet was put to good use by Ellis in his historical and dialectal 
studies; but not only is the precise value to be attached to a letter not 
always easily determined—because of the method of reference to 
sounds in languages—but also the complexity of the system renders it 
difficult for the reader to assimilate. Ellis’s work on notation, however, 
largely inspired the ‘Broad’ and ‘Narrow’ Romic transcriptions of the 
great Henry Sweet (1845-1912). In 1867 Alexander Melville Bell, 
father of Alexander Graham Bell (an inventor of the telephone), 
published his book Visible Speech, while a lecturer on speech in the 
University of London. This remarkable work set out to classify all the 
sounds capable of being articulated by the human speech organs and to 
allot a systematic and related series of symbols to the sounds. The 
unfamiliarity of the invented symbol Shapes was no doubt responsible 
for the fact that this means of notation has never been widely used in 
purely phonetic work, but its value was for many years demonstrated, 
especially in America, as a System applied to the teaching of the deaf. 

Although, in referring to these writers, emphasis has been laid on 
their contribution to the development of phonetic transcription, their 
published work covers every aspect of speech activity. Bell’s interests, 
in his forty-nine publications, lay mainly in the field of elocution and 
the description of articulatory processes. But Ellis and Sweet applied 
the techniques of phonetic analysis both to the description of con- 
temporary pronunciation and also to the whole field of historical 
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phonological investigation. Ellis, in fact, will be chiefly remembered 
for his massive work On Early English Pronunciation, published in five 
volumes between 1869 and 1889. In these volumes Ellis traces the 
history of English pronunciation and, at the same time, contributes 
descriptive phonetic studies of contemporary dialects. It 1s not sur- 
prising that a work of such enormous scope should since have been 
found to be inadequate in many respects, but it cannot be denied that 
Ellis was a great pioneer in the application of objective techniques to the 
description of past and present states of the language. Although his 
assessment of the value of many grammarians from the sixteenth 
century onwards was often faulty, he initiated a study of their work 
which has continued unabated to this day. Henry Sweet, a greater 
phonologist and scholar, applied stringent phonetic techniques to all 
his work, so that,.whether it be a question of phonetic theory or the 
history of English or the description of a language such as Welsh or 
Danish, his basic approach and the majority of his conclusions remain 
valid to-day. He belongs as much to the twentieth century as to the nine- 
teenth and his influence is clearly to be seen in the work of Daniel Jones, 
who has dominated British English phonetics of this century. 

This brief and selective outline goes some way towards revealing a 
line of phonetic inquiry which has been continuous in England from the 
sixteenth century to the present day. The techniques for describing 
speech and language have become progressively more objective, 
modern instrumental methods for physiological and physical investiga- 
tion providing the latest stage in the process. A problem confronting 
linguists of to-day concerns the correlation of concrete data, which is 
being accumulated in great detail by modern instrumental analysis, 
with abstract linguistic realities, many of which have for centuries been 
implicit in the work of writers on language. 


6.2 Sound Change 


The language spoken in England has undergone very striking changes 
during the last thousand years, changes which have affected every 
aspect of the language, its morphology, syntax, and vocabulary as well 
as its pronunciation. Old English! is so different from present-day 
English from every point of view that it is unintelligible to the modern 
Englishman either in its written form or in a reconstructed spoken 
form; Chaucer’s poetry presents difficulties in print and, when read in 
what 1s presumed to be the pronunciation of the fourteenth century, 


¡The following abbreviations will henceforward be used: OE—Old English (up to 
approximately a.D. 1100); ME—Middle English (approx. 1100-1450); eModE—early 
Modern English (approx. 1450-1600); PresE—present-day English; AN— Anglo- 
Norman; OF— Old French. 
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offers a sound pattern which it is not easy for the modern listener to 
interpret; even Shakespeare, though phonetically not far removed from 
ourselves, raises problems of syntax and meaning. 

The pronunciation of a language seems to be subject to a continuous 
and inevitable process of change. Indeed, it would be surprising if a 
means of communication, handed on orally from one generation to 
another, showed no variation over the centuries. It is not difficult to 
find examples of changes which are taking place in our own times, e.g. 
the centralization of the first element of the diphthong in home appears 
to be a fairly recent development; the vowel in saw seems to be some- 
what closer, with stronger lip-rounding, amongst young people in the 
South of England to-day than it is in the pronunciation of older people. 
A change of a different kind—the use of another phoneme in a class of 
words—is illustrated by the case of words such as poor and sure; these 
tend to be said by the older generation with /ua/, whereas the new 
generation much more commonly uses /2:/. At any given moment, 
therefore, we must expect several pronunciations to be current, repre- 
senting at least the older, traditional, forms and the new tendencies. 

To-day there are a number of reasons why we might expect these pro- 
cesses of change to operate less rapidly. The fact that communication 
throughout the whole country is easy, the spread of universal education 
and the resultant consciousness of the printed word, the constant 
impact of broadcasting with its tacit imposition of a standard speech, 
these are all influences which are likely to apply a brake to change in 
pronunciation. They are, however, factors which have operated only in 
comparatively recent times. In former stages of the development of 
English, there was no mass, nation-wide influence likely to lead to 
stability and levelling. Printing, it is true, has been with us for four 
hundred years, but the wide dissemination of books, as of education, is 
a modern development. Indeed, as we have seen, the spelling of 
English, even in printed books, was not finally standardized until the 
eighteenth century. With such freedom from restraint, especially 
before the eighteenth century, we must expect the history of spoken 
English to give evidence of drastic change; but the changes which took 
place were not all of the same kind. ° 


6.2.1 Types of Change 


(1) The most important kind of change tends to affect the realization of 
a phoneme in all its occurrences. Such changes, not usually being set in 
motion by any immediate, outside influence, are in this sense inde- 
pendent; they are often known as ¡solative internal changes. Thus, the 
ME realization of the phoneme in a word such as house had the sound 
[uz], which has generally become [au] in modern English; similarly, the 
ME vowel phoneme having a value of the [a:] type, as in a word such as 
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name, is in most cases realized as a kind of [e1] in PresE. Changes of this 
type apply particularly to the English vowel system, which underwent a 
remarkable evolution of values, known as the Great Vowel Shift, 
during the centuries preceding the modern period. 

(2) Another kind of change which affects the realization of a pho- 
neme is that which is brought about by the occurrence of phonemes in 
particular contexts—a dependent change, often called internal com- 
binative. Thus, the nhoneme in mice, having now the sound [al], results 
from an earlier [1:] by means of an isolative change; but this [1:] sound 
in [mi:s] arose as a result of a combinative process of vowel harmony, 
or /-mutation, through the stages [mu:s1], [my:si] > [mi:s], where the 
change [u:] > [y:] can be explained by the fronting of [u:] under the 
influence of the [i:] of the following syllable. Such a combinative 
change belongs to OE, but a more recent change of this type is exempli- 
fied by words such as swan. This word was probably pronounced 
[swan] or [sween] in about 1600, but the [w] sound has rounded and 
retracted the vowel to give the modern form [swon]. The large majority 
of earlier [w] + [a] sequences have now given [w] + [pd], or [9:], by 
reason of this combinative change affecting this particular sound 
sequence, e.g. want, quality, war, water, etc. 

(3) Some changes are neither independent nor dependent upon the 
phonetic context; they may be said to be external to the main linc of 
evolution. Thus, it was fashionable in Elizabethan times to pronounce 
such words as servant and heard with [zr] or [ar], perhaps originally a 
dialect form, rather than with [er], the regular form of development; 
these words, with some exceptions such as clerk, have reverted to the 
normal development of ME [er],—[3:] rather than [a:]. It was also 
fashionable to pronounce the termination -ing as [in], only now 
retained as a special form of affectation or of vulgar speech. Such 
changes, involving a change of distribution of phonemes among word 
and morpheme classes, do not affect the phonemic system of the 
language. The introduction of foreign words may, however, at least 
temporarily and in the speech of a restricted number of individuals, 
disturb the number of phonemes or their distribution as regards 
position in the word. Thus, if the French word beige is used in English 
with the pronunciation /bel3/, we have a case of a final /3/ unknown in 
English words; or again, if restaurant is pronounced with any kind of 
nasalized vowel in the last syllable, the possibility of a new kind of 
vocalic opposition is introduced into the language. However, such 
foreign borrowings generally tend to conform to the English system: 
words with a final French /3/, such as prestige, camouflage, etc., may 
be realized in the English form with /d3/ and a word with a nasalized 
vowel like restaurant will be normalizcd to /‘restoron/, /'restoront/ or 
/'restrant/, etc. 

(4) In addition to changes of quality, there have also to be taken into 
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account changes involving quality and accentual pattern (stress). Thus, 
the vowel in such words as path, half, pass, still short three hundred 
years ago, is now long in the South of England. Or again, the vowels in 
good, book and breath, death, once long, are now relatively short. 
Changes of accent are particularly striking in the case of words which 
have come into the language from French: in ME, such words as village 
or necessary retained a stress accent accompanied by length on the 
penultimate syllable—/vi'la:d39/ and /,nese'sa:r19/. Now, the main 
stress accent has shifted to an earlier syllable in the English fashion, 
with the qualitative changes which weak accent involves—/ vilid3/, 
/‘nesasri/. Later borrowings, or those in less common use, often retain 
the French accentual pattern—thus, hotel or machine, with stress 
accent on the final syllable, whereas, 1f they had conformed to the 
English system, we might have had such modern forms as /'hautl/ and 
/'meetfin/ or /meitfim/, in the same way that the thoroughly anglicized 
form of garage gives / geerid3/. (See $6.3.4 on current changes.) 


6.2.2 Rate and Route of Vowel Change 


The English vowels have been subject to more striking changes than 
have the consonants. This is not surprising, for a consonantal articula- 
tion usually involves an approximation of organs which can be felt; 
such an articulation tends to be more stable in that it is more easily iden- 
tified and transmitted more exactly from one generation to another. 
Changes in the consonantal system comparatively rarely involve a 
modification of sound (an example of such a modification would be the 
affrication, for combinative reasons, of the OE palatal plosives [c, J] to 
[tf, d3] as in church < OE cirice and bridge < OE brycg). Far more 
common is the type of distributional change involving the conferment 
of phonemic status on an existing sound (e.g. [v, 6, z], allophones of 
/f, 8, s/ in OE, later obtain contrastive, phonemic, significance), or 
the disappearance of an allophone (e.g. postvocalic [x] and [¢] in such 
words as brought and right were largely lost inthe South of England by 
the seventeenth century), or the insertion of an existing phoneme in a 
particular class of words (e.g. the initial /h/ in words of French origin 
such as herb, homage). Whether it is a question of consonantal change, 
loss, or addition, it is usually possible to explain the type of modifica- 
tion which has taken place and the approximate period during which it 
occurred. 

A modification of vowel quality will, however, result from very slight 
changes of tongue or lip position and there may be a Series of imper- 
ceptible gradations before an appreciable quality change is evident (or 
is capable of being expressed by means of the Latin vowel letters). It is 
particularly difficult to assess rate and phonetic route of change in the 
case of those internal independent vowel changes which affect the 
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realization of a phoneme throughout the language. It is known, for 
instance, that the modern homophones meet and meat had in ME dif- 
ferent vowel forms, approximately of the value [e:] and [e:]. The [e:] 
vowel of meet became [1:] by about 1500 and it might be postulated that 
by a process of gradual change the [e: ] of meat first closed to [e:] and 
then, by the eighteenth century, coalesced with the [1:] in meet. The 
available evidence, however, suggests that the change [e: > 1:] may not 
have been either simple or gradual, but that two pronunciations existed 
side by side for along period (the conservative [e: ] beside another form 
[1:] which had resulted from an early coalescence with the meet vowel) 
and that it is the [1:] form which has in most cases survived. In other 
vowel changes, 1t may be agreed that the change was gradual, but 1t 1s 
difficult to date precisely the stages of development. Thus, the modern 
/ai/ of time results from a ME [i:] value; it is clear that the change has 
been one of progressive, widening, diphthongization, but there may 
have been a period of incipient diphthongization when there was hesita- 
tion between the pure vowel [1:] and some such diphthong as [11] or [91]. 
It is well to remember, therefore, that at any particular time in history 
there are likely to be a number of different, coexistent, realizations of 
vowel phonemes, not only as between regions but also between 
generations and social groups. An example of such variety in modern 
English is provided by the diphthong in the word home, which in the 
South of England may be rendered as [au] by the younger generation 
and something more like [ou] by the older people. The speech of any 
community may, therefore, be said to reflect the pronunciation of the 
previous century and to anticipate that of the next. 


6.2.3 Sound Change and the Linguistic System 


It is convenient to study sound change in terms of the development of 
particular phonemes or sounds, but it is misleading to ignore the rela- 
tionship of the sound units to the system within which they function and 
which may, in fact, not be changing. In other words, although there 
may be considerable qualitative changes, the number and pattern of the 
terms within the system may show relative stability. The ME /i:/ 
phoneme, for instance, is now realized as [at], but there is still a 
phonemic opposition which contrasts such words as tine, team, tame, 
term, tomb, etc., and, in any case, a new phoneme /i:/ has emerged in 
words of the team type. On the other hand, the system may change 
because a sound, without itself changing, may receive a new, phonemic, 
value, e.g. the sound [n] has always existed in English as a realization of 
/n/ followed by the velars /k/ or /g/, but when the final /g/ in a word 
like sing was no longer pronounced, /n/ contrasted significantly with 
/n/ and /m/. 

Since the system of our language consists of a framework of signi- 
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ficant oppositions by means of which we communicate, it may be 
assumed that there is a tendency for the system to remain stable, the loss 
of an opposition involving a possibility of confusion. In fact, of course, 
the redundancy of English is such that some degree of neutralization of 
phonemes is easily tolerated: thus, to-day, few speakers in the South of 
England distinguish saw and sore by means of an opposition /9:/-/92/, 
yet the loss of the /33/ diphthong is no impediment to communication. 
An example of an earlier coalescence of vowel phonemes is that illus- 
trated by the homophony of meet and meat. On the other hand, new 
oppositions may emerge in the language, e.g. the phonemes /v, ð, z, n/, 
as we have seen. Nevertheless, despite the adjustments in the number of 
phonemes which have taken place, the history of the English sound 
system displays, over the last 1,000 years, a considerable degree of 
Stability. 

Though the relationships within the system may tend to remain 
stable, a change of phonetic realization of any phoneme is likely to have 
qualitative repercussions throughout the system. Such a disturbance 
may be observed in modern English. The phonetic relationship of the 
vowel phonemes in set and sat, in one type of pronunciation, is of a 
front vowel between half-close and half-open to a front vowel between 
half-open and open. If, however, the vowel of sat has a closer articula- 
tion than that described, that of sef must be raised, too. A limit of 
raising is imposed by the presence of sit and seat, for it is not possible to 
raise the vowel of sit to any extent without danger of confusion with 
that of seat, unless the latter vowel becomes strongly diphthongal. (It 
may be objected that a quantitative as well as qualitative difference dis- 
tinguishes /i:/ from /1/; but, in the examples given—seat and sit—the 
phonetic context imposes a quantity on /1:/ which is practically the 
same as that of /1/. If /1/ were too close to the region of /1:/, the 
opposition would be maintained only by realizing /1: / as fully long at 
the expense of the shortening influence of the final /t/ or by a process 
such as diphthongization.) Alternatively, if the vowel phoneme of satis 
realized as a front open vowel, as in many English regional dialects, the 
vocalic area in which the phoneme of set can be realized becomes more 
extensive; in fact, in those kinds of English where this occurs, the vowel 
in set tends to be half-open variety. Such considerations of the phonetic 
relationship of phonemes have a relevanice in the historical, diachronic, 
study of English. In ME there were, for instance, four long vowels in 
the front region—/i:, e:, €:, a:/. By 1600 /1:/ had diphthongized and 
the remaining vowels closed up. Such a movement may have been 
caused by pressure upwards from /a:/ or by the creation of an empty 
space brought about by the diphthongization of the pure vowel /1:/. 

Although, therefore, it is often convenient in diachronic studies to 
investigate the development of individual phonemes in terms of the 
quality of their realization, it is clear that many sound changes can be 
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explained only by reference to a readjustment of the phonetic relation- 
ships of the phonemes of the system as a whole. Moreover, any 
particular point in the development of a language’s sound system Is not 
simply to be considered as a stage in the process of change of a number 
of sound units but rather as the presentation of the functioning of a 
system at a certain historical moment. The primary significance of the 
sounds of modern English is their function tn the system of to-day; in 
the same way, the English sounds of 1600 are to be viewed in terms not 
only of their past and future forms but also of their contemporary, 
synchronic, relationships and functions. 

Some sound changes are, indeed, the result of an influence which 
applies to the system as a whole. A prosodic, or supra-segmental, 
feature such as the stress-accent provides an example of this kind. 
English has always possessed a strong stress-accent, certain syllables of 
a word or utterance being made more prominent than their neighbours 
because of, amongst other things, the greater force with which they are 
said. Those drastic changes of vowel quality known as the Great Vowel 
Shift mainly affect vowels in accented syllables. But vowels in most 
unaccented syllables (especially those in word final positions) have 
undergone, in the last thousand years, an equally striking, though dif- 
ferent, type of change. Henry Sweet has called OE the period of full 
endings, stanas being realized as ['sta: nas]; ME, the period of levelled 
endings, when stones was pronounced ['sto: nas]; and eModE and later 
English, the period of lost endings, when stones is [sto: nz], [staunz]. 
There is, therefore, a general tendency for all unaccented vowels to 
shorten (1f long) and to gravitate towards the weak centralized vowels 
[1] or [a], or sometimes fu], if not to disappear altogether. This fact 
accounts for the high frequency of occurrence of [1] and [9] in PresE and 
for the complete elision of many vowels in unaccented syllables in rapid 
colloquial speech, e.g. suppose [s pauz], probably ['probbli]. 


6.2.4 Sources of Evidence for Reconstruction 


Whether our aim is to reconstruct the phonological system of English at 
any particular moment in history or to estimate the nature of the 
development affecting particular phonemes, it is necessary to establish 
the sound values which were used in the pronunciation of the language 
—relative values in the case of the system, absolute values as far as 
possible in the case of sound development. An investigation of the 
phonological structure of PresE would have to include direct observa- 
tion of its phonetic features. For this purpose, future generations will 
have the benefit of recordings of the speech of to-day. Obviously this 
type of evidence cannot be used for the reconstruction of past states of 
the spoken language. The further back we go into history the scantier 
the evidence of spoken forms becomes. Our conclusions will, therefore, 
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be based on information mostly of an indirect kind; yet such is the 
agreement generally amongst the various types of evidence that the 
broad lines of sound change can be conjectured with reasonable 
certainty. 

(1) Theoretical paths of development.—If, in dealing with the 
changing realization of a particular phoneme, we can be reasonably 
sure of its sound value at two points in history, we can, from our knowl- 
edge of phonetic possibilities and probabilities, infer theoretically the 
intervening stages of development. We can, of course, be sure of the 
pronunciatron of PresE. If, then, the evidence suggested unequivocally 
that, for instance, the vowel in home was pronounced as [a:] in OE, the 
development to be described and accounted for would be [a:] > [au]. It 
is likely that the articulation has always involved the back, rather than 
the front, of the tongue; the change has clearly meant a closing of the 
tongue position, to which at some stage there has been added a gliding 
(diphthongal) movement. We might, therefore, postulate such develop- 
ments as [a : > au > 3u > OU > du] or [a: > 9: > o: > OU > au]. The 
available evidence will then confirm or refute the hypothesis—in this 
case the second solution being more in keeping with the information. 
Such recognition of phonetic probabilities will always be implicit in the 
tracing of change. It must be considered unlikely that [a:] on its way to 
[ou] or [au] would have passed through a stage of front articulation, 
without any combinative influence. Nevertheless, the possibility of a 
type of change which is not the most probable theoretically must never 
be excluded, for often the evidence points to only one conclusion. It 
would, therefore, be dangerous to predict, merely according to 
phonetic probablities, the way our present sound system will develop. 
The half-close back rounded ME [u] has generally developed in PresE 
to an advanced back half-open unrounded vowel— [1]; but this vowel 
now, in the South of England, appears to have been fronted to some- 
thing nearer [a] or [á]. Its future must be quite uncertain. 

(2) Old English.—It is most important in an investigation of the 
development of English sounds over the last thousand years that the 
pronunciation of OE should be established with some certainty. If this 
can be done, we shall have a ‘starting point’ for the phonetic route of 
change to PresE. The term Old English, however, spans a period of 
some four hundred years from about A.D. 700 to A.D. 1100. Moreover, 
the invasion of the Angles, Saxons, and Jutes in the fifth and sixth 
centuries introduced four separate varieties of English: the Angles, in 
the Midlands, north-east England, and the south of Scotland, using 
types of English known as Mercian and Northumbrian (or, in general 
terms,-Anglian); the Saxons, in the south and south-west, using the 
West-Saxon dialect; and the Jutes, settling mainly in the region of Kent 
and using a dialect called Kentish. Of the four dialects, West-Saxon, 
which was to become a kind of standard language, is the one about 
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which most is known from the extant texts. In its later form—that in use 
between about A.D. 900 and A.D. 1100—it is referred to as Classical OE. 

The broad lines of the pronunciation of this language can be con- 
jectured from a comparison of the development of the other members 
of the West Germanic group of languages to which it 1s related. But by 
far the most explicit evidence concerning its sounds is to be inferred 
from the alphabet in which it is written. The earlier Runic spelling was 
replaced by a form of the Latin alphabet. This alphabet was probably 
introduced into the country in the seventh century by Irish missionaries. 
It can be assumed, therefore, that the sounds of OE were represented as 
far as possible by the Latin letters with their Latin values, with some 
modifications of an Irish kind. A great deal is known about the pro- 
nunciation of Vulgar Latin, whose sound system had much in common 
with that of modern Italian. If an Italian, knowing no English, were to- 
day asked to write down with his own spelling the PresE pronunciation 
of the word mi/k, [mitk], he would have no difficulty in representing the 
first sound, which he could spell as m; the vowel [1] might, however, 
seem to him to resemble the sound he would write in Italian as e rather 
than as /; the ‘dark’ [t] would appear to have a back vowel glide accom- 
panying it, requiring a spelling such as o/; and, since he has no & letter, 
he would spell the final [k] as c. His transcription of the word might, 
therefore, be meolc, whichis, in fact, a West-Saxon spelling of the word 
now written milk. This is a fortuitous example and must not be taken to 
suggest that OE was pronounced in the same way as PresE. But it does 
demonstrate that OE spellings, which may appear to be very different, 
are often less surprising when we keep in mind the Latin values 
originally attached to the letters. 

Sometimes the simple forms of the Latin alphabet were evidently 
inadequate for representing the English sound: thus, the joined form «e 
was used to symbolize a sound between C[a]' and C[e]; the sounds [6] 
and [0] were written in the earlier manuscripts as th initially and d 
medially and finally in a word, and later as ð or the rune p, regardless of 
the sound’s position in the word or its voiced or voiceless quality; the 
rune P frequently replaced the earlier u or uu. The vowel values of the 
OE system were particularly difficult to represent with the five Latin 
vowel letters. Sometimes the spelling used will hesitate between two 
letters: thus, the vowel of mann, probably of a C[a] or [o] quality, is 
written either with a or o, indicating a vowel between the open central 
unrounded value of the Latin letter a and the back half-open to half- 
close rounded value of o. Unaccented vowels, too, already beginning to 
be obscured and levelled, presented a problem to the scribes, the Latin 
alphabet offering no way of showing a central vowel of the [ə] type. 


! Henceforth a cardinal vowel value will be referred to as C, followed by the appro- 
priate symbol. 
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Unaccented Œ, e, and í soon begin to be written as e and unaccented a, 
u, O later tend to be used indifferently, indicating that the vowel 
distinction was being lost. A diphthong such as the one written as ea 
must probably be interpreted as a glide to a central [a] quality. 

Quantity is often shown in the case of vowels by doubling the letter or 
by the use of an accent and in the case of consonants by doubling the 
letter. The stress-accent in a word is also sometimes shown by the use of 
a mark; but, in any case, it is agreed, from a comparison of the West 
Germanic languages, that the word accent in OE fell generally on the 
first syllable of words, with the exception of certain compounds. 

The written form of OE provides us, therefore, with considerable 
information concerning the language’s pronunciation; we have a 
working hypothesis from which to begin our investigations. The study 
of later forms of English will often, in fact, confirm that the OE pro- 
nunciation postulated from the spelling and the comparison of 
Germanic languages is the only one from which later forms can be 
expected to have developed. 

(3) Middle English.—Spelling forms can also help us to deduce the 
pronunciation of the ME period, roughly A.D. 1100—1450. Generally 
speaking, it may be said that the letters still had their Latin values and 
that those letters which were written were meant to be sounded. Thus, 
the initial k in a word such as knokke was still pronounced and the 
vowel in time would have an [i] quality. This persistence of Latin values 
in spelling was no doubt due to the influence of the Church, which was 
still the centre of teaching and writing, and the absence of a thoroughly 
standardized spelling accounts for its predominantly phonetic char- 
acter. However, English spelling was modified by French influences. 
Notably, the French ch spelling was introduced to represent the [tf] 
sound in a word such as chin (formerly spelt cinn), where the new 
spelling form indicates no change of pronunciation; in addition ou, or 
ow, represents the sound [u], formerly written u, e.g. hous, in OE hus. 
The simple u spelling was retained to express both the French sound [y] 
in words like duke and fortune and the OE short [u] sound, though this 
latter sound is often written as o, especially when juxtaposed to letters 
of the w, m, n type, e.g. wonne rather than wunne, to avoid confusion 
between the letter shapes. j 

Rhymes, too, have their value, especially as, in this period, they are 
likely to have been satisfactory to the ear as well as to the eye—in the 
whole of Chaucer’s work, for instance, there are very few rhymes which 
appear to involve the pairing of different vowel sounds. Nevertheless, 
evidence from rhymes is valueless unless it is possible to be certain, 
from other sources of evidence, of the pronunciation of one member of 
the pair. Thus, the Chaucerian rhyme par cas :: was, because we can be 
sure that the French word cas had a vowel of the [a] quality, is evidence 
to confirm the view that the [w] of was had not yet retracted and 


78 An Introduction to the Pronunciation of English 


rounded the vowel to [b] and, the final s in the two words being still 
likely to represent [s], that the word was probably pronounced [was]. 

Again, words imported from French can give us information con- 
cerning the timing of sound changes. Thus, French words such as age 
and couch, which we know from French sources had [a:] and [u: ] at the 
time of their introduction into English, fell in with the English vowel 
development [a:] > [ei] and [u:] > [au] in words like name and house; 
we can conclude, therefore, that at the time the French words came into 
the language the [a:] and [u: ] vowels had not begun their change. 

Moreover, after the ME period, as we shall see, a great deal of direct 
evidence is available to us, so that our conjectures from about 1500 
onwards can be made with considerable certainty. We may often, there- 
fore, be able to deduce from our knowledge of pronunciation in the six- 
teenth century the stage probably reached in the ME period in the 
development of a sound from OE. The OE [i:] sound in time, for 
example, was beginning to be diphthongized generally very early in the 
sixteenth century. It is reasonable to suppose (even if other evidence to 
support the theory did not exist) that time still had a relatively pure [1: ] 
for much of the ME period. 

Finally, the metre of verse reveals the stress accent of words. It is for 
this reason that we know that French words, in Chaucer’s verse, gener- 
ally retained their original accentual pattern, e.g. courage [ku ra: d3a], 
and that the accent shift in these cases is a phenomenon of at least late 
ME. 

(4) Early Modern English.—The same sources of evidence which we 
have already considered remain available for the eModE period, 
roughly A.D. 1450—1600. The introduction of printing brought stan- 
dardization of spelling and already the spoken and written forms of the 
language were beginning to diverge. But individuals, especially in their 
private correspondence, often used spellings of a largely phonetic kind, 
in the same unsophisticated and logical way that children still do. If a 
modern child writes He must have gone as He must of gone, he is only 
representing the phonetic identity of the weak forms of have and of 
([əv]), an identity which he will learn to ignore when he adopts the 
conventional spelling distinction. In the same way, if fifteenth- and 
sixteenth-century spellings show the word sweet occasionally written as 
swit, 1t may be assumed that this original ME [e:] was by now so close 
that it could be represented by / with its Latin value. Or again, the 
spelling form sarvant instead of servant reflects an open type of vowel 
in the first syllable which was current throughout the eModE period in 
such words. Moreover, the conventional adoption of an unphonetic 
spelling can sometimes provide us with positive evidence as to its value: 
thus, when words like delight (formerly delite) began to be spelt with 
gh, this spelling form gh clearly no longer had the consonantal fricative 
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value which it had formerly represented in light, since there never was a 
consonantal sound between the vowel and final [t] in delight. We may 
conclude, therefore, that gh no longer had its former phonetic signi- 
ficance in words such as light. Care must, of course, be taken to identify 
the increasing number of learned or technical spellings adopted by 
printers. The initial letter group gh in ghost (OE gast) indicates no 
change in pronunciation—goose was also sometimes spelt ghoose in 
this period. Again, spellings which aim at revealing the etymology (true 
or false) of a word must usually be discarded as phonetically valueless, 
e.g. debt, island. It is above all from the writings of individuals that 
some general indications concerning sound changes may be gathered 
and used to supplement evidence derived from other sources. 

Rhymes, too, continue to be useful as complementary evidence. A 
rhyme such as night :: white confirms the view that postvocalic gh no 
longer had a consonantal value; or again, can :: swan suggests that the 
rounding of [a] after [w] had not yet taken place. Yet, just asin the case 
of ME, rhymes must be treated with caution, more particularly as eye- 
rhymes were doubtless beginning to become more prevalent. There is, 
however, in Elizabethan literature, additional evidence afforded by the 
frequent use of puns, which usually rely for their effect upon similari- 
ties, if not identities, of phonetic value. Shakespeare, for instance, 
plays on the phonetic identity of such pairs as suitor, shooter (both 
capable of being pronounced [fu:tar]) and known, none (both [no:n])); 
such puns suggest that the pronunciation of the two words was 
commonly sufficiently close to make an immediate impression upon an 
audience. 

The most important and fruitful evidence for this period is, however, 
of a direct kind. It is provided by the published works of the contempo- 
rary grammarians, orthoepists, and schoolmasters, some of whom 
have been mentioned above (§6.1 ef seg.). They are of unequal value 
and their statements have often to be interpreted in the light of other 
evidence; yet they provide us with the first direct descriptive accounts of 
the pronunciation of English. From the sixteenth century onwards, our 
conclusions rely more and more on their descriptive statements and less 
on clues of an indirect kind. Sometimes there appears to be a conflict 
between the phonetic probabilities, the statements of grammarians, 
and evidence from other sources. Frequently the solution must be that 
there existed at any time a variety of current pronunciations, resulting 
from differences of dialect, generation, fashion, and place in society, in 
the same way that a description of PresE (even that of a restricted area 
such as the south of England) would have to take into account a large 
number of variants. 

The following representative systems are conjectures of one possible 
set of phonemes current in the period in question. 
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6.2.5 Classical OE Sound System 


Vowels 121,13 y2,y ur,U 
ae Or) 
E E 
a:, a (allophone [p] before nasal consonants) 
[ə] occurs in certain weakly accented syllables. 
Diphthongs €13,63, 6:13, 63 
Consonants p,b,t,d, k,g (allophone [y] 
tas 
m, n (allophone [n] before velar consonants) 
leah 
f, 8, s (medial allophones [v, ð, z]) 
$, h (allophones [x, ç) 
j, W 

Consonants may be long or short. 

The spellings An, hl, hr, hw may be interpreted as phoneme sequences 
/h/ + /n, l, r, w/; alternatively, if it is assumed that / is here an indica- 
tion of voiceless [n, 1, r, w], these four sounds may be counted as con- 
trastive, 1.e. of phonemic status. 


Text (St. John, Chapter 14, verses 22, 23) 

22. ‘ju:das 'kwze8 to: him. næs na: se: 'skariot. ‘drictan, 'hweet Is 
jo wordan Get Ou: wilt Be: 'sylfnə jaswutelijan us næs 
middaneordo. 

23. se: 'hee:land 'pndswaroda ond ‘kweeé him; jif hwa: me: ‘luvaé he: 
‘hilt mi: nə 'spræ:tfə ond mi:n 'fædər luvaé hina ond we: ‘kumaé 
to: him pnd we: 'wyrk1a8 'eordungsto: wa mid him. 

Authorized Version 

22. Judas saith unto him, not Iscarioth, Lord, how 1s it that thou wilt 
manifest thyself unto us, and not unto the world? 

23. Jesus answered and said unto him, If a man love me, he will keep 
my words; and my Father will love him, and we will come to him, 
and make our abode with him. 


m 


6.2.6 ME Sound System 


Vowels 1:, 1 0 
e: oO: 
Gite O 


a:,a a: 
[ə] occurs in unaccented syllables. 
Diphthongs £i, (Æi), 31, lu, (eu), eu, du, (au) 
Consonants PO Cada R de ues 
m, n (allophone [n] before velar consonants) 
Ret 
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f, v, 8, 0, s, z, f, h (allophones [x, çl) 
j, w (allophone [m] after /h/) 
Text (from the Prologue to the Canterbury Tales)! 
'hwan Bat 'a:pril ,wi his 'fuzrəs 'soztə 
8ə 'druxt of 'martf ha@ 'persad ,to: da 'ro:to, 
and 'ba:dad 'e: vri 'veein in 'switf li'kuir 
of ‘hwitf ver'tiu en d3endard s 8ə 'flu:r, 
hwan ,zefirus e:k ,wi his 'swe:to 'bre:8 
n'spi:rad 'ha8 in 'e:vri ‘holt and ‘he:6 
83 'tendar 'kroppas, ¡and da ‘junga 'sunna 
'ha8 in da ‘ram his ‘halva 'kurs 1'runna, 
and 'sma:lo 'fu:los 'ma:kon ¡melo dita, 
Oat 'slezpan 'a:1 da 'nigt w18 '9: pan 'i:9— 
so: 'prıkəð ‘hem na: tiur ın 'hir ku'ra:d3os— 
8an lo: ngan folk to: 'go:n on ,pilgri'ma:d3es. 


6.2.7 Early ModE Sound System 


Vowels i:,1 uz, U 
67 O:, Y 
Ene 9 
e Di, D 
/e:/ was probably /i:/ or /e:/ in certain types of pro- 
nunciation 


[a] and [a:] occur as contextual variants of /æ/ and /D:/. 
Diphthongs 31, au, lu (or ju), eu, Ou, 91, ul, €l 
Consonants p,b,t,d,k, g, tf, d3 
m,n,n 
er 
f, v, 8, ð, s, z, f, 3 (later, in medial positions), h 
j, w (allophone [m] after /h/) 
Text (Macbeth, Act Il, Scene 1) 
nou o:ar ğə wyn ha: f wyrld 
ne:tar si: mz ded, and wikid dre: mz abju:z 
da kyrteind sli: p: witfkraft selibre its 
peil hekots pfarinz: and widard myrdar, 
alaramd bai hız sentinal, da wulf, 
hu:z houlz hız watf, dvs w18 hız stel@1 pels, 
w18 tarkwinz reevifin straidz, tuzardz hız dizain 
mu:vz laik ə go:st. dau sju:r? and ferm-set er@ 
he:r not mai steps, hwitf wei Óe1 work, for fe: r 


1 The type of transcription given here is slightly archaic for Chaucer’s pronunciation, 
e.g. long consonants were probably lost in later ME and such words as and, that, would 
have had a weak vowel. 

2 Alternatively, [f] or [fj] for (sj). 
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Odi ver! Sto:nz pre:t av mol hwe:rabout, 
ond te:k 6a prezant horar from 6a toim, 
hwitf nou sju:ts! wid it. 


6.2.8 PresE Sound System 


Vowels 1:,1 U, u: 
e 
J > 
e 
A 
a: D 


Diphthongs ei, 3U, al, au, 31, 19, €9, Ud 
Consonants p,b,t,d,k,g 

tf, d3, (tr, dr) 

m, n, y 

l, r 

Evo Oe Mean [eee |) 

j, W 


6.2.9 Modifications in the English System? 


l. Distribution of phonemes.—The similarities of the systems given 
above may obscure the fact that the same sound, especially as far as the 
vowels are concerned, may occur in different categories of words 
according to the period. Thus [u:], now in food, occurred in OE in 
words such as town; [i:], now in team, occurred in OE in time. The 
following summary shows some of the most striking changes affecting 
the vowel quality used in particular word categories:— 


OE ME eModE PresE 
time i i: a] al 
sweet e: e: i i 
clean æ: el ez (or [1:)) E 
stone a: 3: O: 3U 
name a a: El el 
moon 0: O: us u: 
house u: ul ou au 
love U U ¥ A (or [á]) 


2. Vowel changes. —Several trends become apparent from a study of 
quality changes: — 


1 See footnote 2 on previous page. 

21t should be noted that some present-day regional accents preserve distinctions or 
realizations manifested at earlier stages in the history of the language. See J. C. Wells, 
Accents of English, Cambridge University Press, 1982. 
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(a) OE long vowels have closed or diphthongized; on the other hand, 
PresE [au] and [ei] show signs of monophthongization. 

(0) Certain phonemic qualitative oppositions have coalesced, e.g. 
OE /e:/ and /e: /; the originally separate diphthongs of day and way; 
the diphthong of know with the originally pure vowel of no; the 
diphthongs of day, way with the former pure vowel of name; OE 
/y:, y/ with /1:, 1/ (or 7E/), 

(c) Short vowels, with the notable exceptions of the OE /a, z/ (and 
the short diphthong /e3/) in open syllables and ME /u/, have remained 
relatively stable. 

(d) Rounded front vowels have been lost, e.g. OE /y:, y/ and earlier 
ro, @/. 

(e) The loss of post-vocalic [r] in the eighteenth century gave rise to 
the PresE centring diphthongs /19, ea, ua/, the pure vowel /3:/ and 
introduced /a:, 9: / into new categories of words (cart, port). 

($) Vowels under weak accent are increasingly obscured to [a] or [1], 
or are elided. 

(g) Changes of quantity affected certain phonemes in particular con- 
texts or sets of words, e.g. lengthening of OE /a, «e, €a/ in open 
syllables and of ME /a/ + /f, 8, s/; and shortening of ME /o:/ in 
words like good, book, blood, and of ME /e: / in such words as breath, 
death, head. 

3. Consonant changes.—Changes in the consonantal system are less 
striking, but the following may be noted: — 

(a) Double (or long) consonants within words were lost by late ME; 
certain other consonant clusters cease to be tolerated, e.g. /hl, hr, hn/ 
by ME and /kn, gn, wr/ in the eModE period; post-vocalic /r/ was lost 
in much of the south-east of England in the eighteenth century. 

(b) Allophones of certain phonemes have been lost, e.g. the [y] allo- 
phone of /g/ in late OE and the [x, ç] allophones of /h/ in eModE. 

(c) New phonemes have emerged, e.g. /tf, d3/ in OE, /v, 0, z/ in 
ME, and /n, 3/ in eModE; in addition, /h/ is used initially in words of 
French origin where, originally, no [h] sound was pronounced (habit, 
herb, humble, etc). 


6.3 Standards of Pronunciation 


The English are to-day particularly sensitive to variations in the pro- 
nunciation of their language. The ‘wrong accent’ may still be an 
impediment to social intercourse or to advancement or entry in certain 
professions. Such extreme sensitivity is apparently not paralleled in any 
other country or even in other parts of the English-speaking world. 
There are those who claim, from an elocution standpoint, that modern 
speech is becoming increasingly slovenly, full of ‘mumbling and 
mangled vowels and missing consonants’. Alexander Gil and others 
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made the same kind of complaint in the seventeenth century. There is, 
in fact, no evidence to suggest that the degree of obscuration and 
elision, often characteristic of a dynamic stress language such as 
English, is markedly greater now than it has been for four centuries. Of 
more significance—social as well as linguistic—is the attitude which 
regards a certain set of sound values as more acceptable, even more 
‘beautiful’ than another. Judgments of this kind suggest that there is a 
standard for comparison; and it is clear that such a standard pronuncia- 
tion does exist, although it has never been explicitly imposed by any 
official body. A consideration of the origins and present nature of this 
unofficial standard goes some way towards explaining the contro- 
versies and emotions which it arouses at the present day. 


6.3.1 The Emergence of a Standard 


It is clear that the controversy does not centre around the written 
language: the spelling of English was largely fixed in the eighteenth 
century; the conventions of grammatical forms and constructions as 
well as of the greater part of our vocabulary have for a long time been 
accepted and adhered to by the majority of educated English speakers. 
Indeed, the standardization of the written form of English may be said 
to have begun in the ninth and tenth centuries. But there has always 
existed a great diversity in the spoken realizations of our language, in 
terms of the sounds used in different parts of the country and by 
different sections of the community. On the one hand, the sounds of the 
language always being in process of change, there have always been at 
any one time disparities between the speech sounds of the younger and 
older generations; the advanced speech of the young is traditionally 
characterized by the old as slovenly and debased. On the other hand, 
especially in those times when communications between regions were 
poor, it was natural that the speech of all communities should not 
develop either in the same direction or at the same rate; moreover, dif- 
ferent parts of the country might be exposed to different external 
influences (e.g. foreign invasion) which might influence the phonetic 
structure of the language in a particular area. English has, therefore, 
always had its regional pronunciations in the same way that other 
languages have been pronounced in a variety of ways for basically geo- 
graphical reasons. Yet, at the same time, especially for the last five 
centuries, there has existed in this country the notion that one kind of 
pronunciation of English was preferable socially to others; one regional 
accent began to acquire socia/ prestige. For reasons of politics, com- 
merce, and the presence of the Court, it was the pronunciation of the 
south-east of England, and more particularly to that of the London 
region, that this prestige was attached. The early phonetician John Hart 
notes (1569) that it is in the Court and London that ‘the flower of the 
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English tongue is used’. . . ‘though some would say it were not so, 
reason would we should grant no less: for that unto these two places, do 
daily resort from all towns and countries, of the best of all professions, 
as well of the own landsmen, as of aliens and strangers .. .’. Putten- 
ham’s celebrated advice in the Arte of English Poesie (1589) recom- 
mends the ‘usual speech of the Court, and that of London and the shires 
lying about London within 60 miles and not much above. . . . Northern 
men, whether they be noblemen or gentlemen, or of their best clerks, 
(use an English) which is not so courtly or so current as our Southern 
English is’. Nevertheless, many courtiers continued to use the pronun- 
ciation of their own region; we are told, for instance, that Sir Walter 
Raleigh kept his Devon accent. The speech of the Court, however, 
phonetically largely that of the London area, increasingly acquired a 
prestige value and, in time, lost some of the local characteristics of 
London speech. It may be said to have been finally fixed, as the speech 
of the ruling class, through the conformist influence of the public 
schools of the nineteenth century. Moreover, its dissemination as a 
class pronunciation throughout the country caused it to be recognized 
as characteristic not so much of a region as of a social stratum. With the 
spread of education, the situation arose in which an educated man 
might not belong to the upper classes and might retain his regional 
characteristics; on the other hand, those eager for social advancement 
felt obliged to modify their accent in the direction of the social 
standard. Pronunciation was, therefore, a marker of position in 
society. 


6.3.2 The Present-day Situation 


(1) Great prestige is still attached to this implicitly accepted social 
standard of pronunciation. Often called Received Pronunciation (RP), 
the term suggesting that it is the result of a social judgment rather than 
of an official decision as to what is ‘correct’ or ‘wrong’, it has become 
more widely known and accepted through the advent of radio. The 
BBC formerly recommended this form of pronunciation for its 
announcers mainly because it was the type which was most widely 
understood and which excited least prejudice of a regional kind. 
Indeed, attempts to use announcers who had a mild regional accent 
provoked protests even from the region whose accent was used. Thus, 
RP often became identified in the public mind with ‘BBC English’. This 
special position occupied by RP, basically educated Southern British 
English, has led to its being the form of pronunciation most commonly 
described in books on the phonetics of British English and traditionally 
taught to foreigners. 

(2) Nevertheless, it cannot be said that RP is any longer the exclusive 
property of a particular social stratum. This change is due partly to the 
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influence of radio in constantly bringing the accent to the ears of the 
whole nation, but also, in considerable measure, to the modifications 
which are taking place in the structure of English society. Just as the 
sharp divisions between classes are beginning to disappear, so the more 
marked characteristics of regional speech and, in the London region, 
the popular forms of pronunciation, are tending to be modified in the 
direction of RP, which is equated with the ‘correct’ pronunciation of 
English. This tendency does not, as yet, mean that regional forms of 
pronunciation show signs of disappearing, but it has to be recognized 
that those who wish, for any reason, to modify their speech have 
models of RP always readily available to their ears while, at the same 
time, the social inhibitions concerning movement between classes, 
which were formerly so strongly operative, no longer, in the face of the 
standardization of society as a whole, exert the same pressure. 
Moreover, it must be remarked that some members of the present 
younger generation reject RP because of its association with the ‘Estab- 
lishment’ in the same way that they question the validity of other forms 
of traditional authority. For them a real or assumed regional or popular 
accent has a greater (and less committed) prestige. It 1s too early to 
predict whether such attitudes will have any lasting effect upon the 
future development of the pronunciation of English. But, if this 
tendency were to become more widespread and permanent, the result 
could be that, within the next century, RP might be so diluted that it 
could lose its historic identity and that a new standard with a wider 
popular and regional base would emerge. Such a change is made more 
likely through the recent more permissive attitude of the BBC (and of 
the commercial television companies) in their choice of announcers, 
several of whom now have markedly non-RP or non-British accents. 
(3) Certain types of regional pronunciation are, indeed, firmly 
established. Some, especially Scottish English speech, are universally 
accepted; others, particularly the popular forms of pronunciation used 
in large towns such as London, Liverpool, or Birmingham, are 
generally characterized as ugly by those (especially of the older genera- 
tions) who do not use them. This rejection of certain sounds used in 
speech is not, of course, a matter of the sounds themselves: thus, [paint] 
may be acceptable if it means pint, but ‘ugly’ if it means paint. It is 
rather a reflection of the social connotations of speech which, though 
they have lost some of their force, have by no means disappeared. 
Indeed, RP itself can be a handicap if used in inappropriate social situa- 
tions, since it may be taken as a mark of affectation or a desire to 
emphasize social superiority.’ It may be said, too, that if improved 
communications and radio have spread the availability of RP, these 
Same influences have rendered other forms of pronunciation less 


l For accounts of attitudes to RP, see works by Giles in the bibliography. 
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remote and strange. An American pronunciation of English, for 
instance, is now completely accepted in Britain; this was not the case at 
the time when the first sound films were shown in this country, an 
American pronunciation then being considered strange and even diffi- 
cult to understand. Speakers of RP are becoming increasingly aware of 
the fact that their type of pronunciation is one which is used by only a 
very small part of the English-speaking world. 

(4) A comparison of two regional or social types of pronunciation 
will reveal differences of several kinds (as mentioned in $5.3.6):— 

(a) realizational differences the system, 1.e. the number of distinc- 
tive (phonemic) terms operating, may be the same, but the phonetic 
realizations of the phonemes may be different: e.g. the RP opposition 
between the vowels of bet and bat may be maintained, but the realiza- 
tion of both vowels is much more open than in RP (see §7.9.4), so that 
the sound of /z/ may come near to that of one type of RP /aA/ (see 
§7.9.5); or when, as in Cockney, an allophone [?] in unaccented 
positions represents /t/ and often /k/ or /p/ (see §8.2.7); or when the 
final allophone of /1/ is [l] rather than [tł] (see §8.7); 

(b) systemic differences (i.e. differences in phoneme inventory) The 
system may be different, 1.e. the number of oppositions may be smaller 
or greater: e.g. the RP /z/—/a:/ opposition may not be present in 
those Ulster or Scottish forms which do not distinguish Sam and psalm; 
or when RP /ai/ homophones, as in side and sighed may be differen- 
tiated qualitatively or quantitatively, as in some types of Scottish 
English; or when, as in some forms of RP, /a: / levels with /ai/ and 
/au/ before /3/ (see $7.11); or when the presence of /g/ after [n] in such 
a word as sing deprives [n] of its phonemic status (see $8.6.2); 

(c) lexical differences (1.e. differences of lexical incidence) The 
system may be the same, but the incidence of phonemes in words is 
different: e.g., in those Northern forms which have the RP opposition 
/u: /—/u/, but nevertheless use /u: / in book, took, etc. (see §§7.9.9— 
10); or when /D/ is used instead of /A/ in one, among, etc., though the 
opposition /p/—/a/ exists (see §7.9.5); or when'the choice of phoneme 
is associated with the habits of different generations, e.g. /9: / for /D/ 
in off, cloth, cross, etc. (see §7.9.7) or /e1/ for /1/ in Monday, holiday, 
cic. 

(d) distributional differences The system may be the same, but the 
phonetic context in which certain phonemes occur may be limited: e.g. 
in RP /r/ has a limited distribution, being restricted in its occurrence to 
pre-vocalic position as in red or horrid. Accents which display this 
limited distribution of /r/ are referred to as non-rhotic accents, whilst 
those in which /r/ has a full distribution (such as most American and 
Scottish accents) are termed rhotic. In the latter accents /r/ occurs pre- 
consonantally and pre-pausally as well as pre-vocalically, thus part and 
car will be pronounced /pa:rt/ and /ka:r/ whereas in non-rhotic 
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accents the pronunciation will be /pa:t/ and /ka: / respectively. See 
§8.8. 

(5) It is possible to make some classification of the various kinds of 
British English pronunciation in the light of the regional and social 
criteria mentioned. Features of regional pronunciation, without any 
contamination from RP, will be found in highly educated and less 
educated speech (to be distinguished briefly as educated regional and 
popular regional); lack of conventional education will often, in 
addition, reveal itself by the use of non-standard grammatical and 
lexical forms. On the other hand, a regional pronunciation may be 
termed modified when it has adopted certain characteristics of RP, e.g. 
the adoption of the RP [a:] in words such as ask, after, path rather than 
[a]. In the case of the speech of the Greater London population, there 
are a great number of gradations to be noted amongst the popular 
regional forms, through educated regional to RP; such varying 
mixtures of regional and RP are typical of the suburban districts. 

Within RP itself 1t is convenient to distinguish three main types: the 
conservative RP forms used by the older generation and, traditionally, 
by certain professions or social groups; the general RP' forms most 
commonly in use and typified by the pronunciation adopted by the 
BBC; and the advanced RP forms mainly used by young people of 
exclusive social groups—mostly of the upper classes, but also, for 
prestige value, in certain professional circles. In its most exaggerated 
variety, this last type would usually be judged “affected” by other RP 
speakers, in the same way that all RP types are liable to be considered 
affected by those who use unmodified regional speech. Advanced pro- 
nunciations, however, whenever they are not the result of temporary 
fashion, may well indicate the way in which the RP system is developing 
and be adopted in the future as general RP, e.g. the originally advanced 
(“affected”) diphthong in home, involving increased centralization and 
a tendency towards monophthongization, seems likely to become 
general in a very short time. 

These various types of British English pronunciation may be sum- 
marized as follows: — 


Regional (Reg) Received (RP) 
educated (educ) conservative (cons) 
popular (pop) general (gen) 
modified (mod) advanced (adv) 


6.3.3 Notions of Correctness 


Within RP, those habits of pronunciation that are mostly firmly estab- 
lished tend to be regarded as ‘correct’ whilst innovation tends to be stig- 


! Also called mainstream RP. 
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matized. Thus conservative forms tend to be most generally accepted, 
sometimes even by those who themselves use other pronunciations. 

Where the stress patterns or the phonemic structure of disparate 
words is concerned, this attitude may result in a speaker’s use of the 
conservative variant in a formal situation and the use by the same 
speaker of a less well-established variant in more casual speech, e.g. the 
avoidance of /,veri faiabl/ (verifiable) and /‘d3uarin/ (during) in more 
formal speech and their replacement with the more conservative 
/'verifaiobl/ and /‘djuorin/. It may be of interest that the pronunciation 
/‘d3uerin/ with initial coalescent assimilation was acknowledged by 
Daniel Jones in the English Pronouncing Dictionary in the 1960’s and 
noted as long ago as 1913 by Robert Bridges in his Tract on English 
Pronunciation. Nevertheless, there is still resistance to accepting 
coalescence word-initially in stressed syllables. 

Where realizational variation (below the level of the phoneme) is 
affected, most speakers are unaware of their own changing speech 
patterns. Objections to the use of the glottal stop are often made, its use 
being popularly associated with Cockney speech, and yet its occurrence 
as a realization of preconsonantal /t/ is increasingly frequent within the 
speech of the middle and younger generations of RP speakers. (See 
§8.2.7 b.) 

It is not, therefore, possible to be certain that the resistance to accept- 
ing certain innovatory pronunciations will be overcome. For that 
reason, it is most realistic here to sketch only the more widespread 
general trends. 


6.3.4 Current Changes! 


(1) Realizational changes 


Several vowels should be mentioned here, notably /z/ with an opener 
and more retracted quality than in more conservative speech, /ea/ with 
a monophthongal realization in some environments, a tense word-final 
/1/ with a somewhat closer quality than is traditional, and /al/ and 
/au/ sharing the same central open starting point. (See Chapter 7 for 
detailed discussion of each vowel.) * 

As has been mentioned in §6.3.3, the realization of preconsonantal 
/t/ as a glottal stop is increasingly common in present-day RP. (See 
§8.2.7.) 


(2) Systemic changes 


The one recent systemic change that is now more or less completed is the 
loss of /23/ from the phoneme inventory. 


! For more detail, see Susan Ramsaran, ‘RP: Fact and Fiction’. 
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(3) Lexical changes 


There is a strong trend towards selecting /a/ instead of unstressed /1/ in 
weak syllables, the choice of /a/ being particularly favoured after /I/ 
and, more especially, /r/, e.g. angrily /zngrili/ > /ngrali/. For 
further detail and examples, see §7.9.2. 

The other noticeable trend is the replacement of /Ua/ by /9:/ in many 
common words, e.g. poor /po:/, sure /fo:/, though /ua/ still retains its 
phonemic status, its contrastive function being illustrated in the speech 
of most speakers by such sets as doer, dour, door /'du: a, dua, da: / (the 
first, as in evil-doer, serving to illustrate that [ua] is not merely a 
realization of /u:/ + /2/). 


(4) Distributional changes 


The most noteworthy trend concerning a regular change in the occur- 
rence of a phoneme is the loss of /j/ after alveolar consonants in such 
words as allude /alju:d/ > /alu:d/, luminous /'lju: minas/ > 
/‘lu:minas/, supersede /sju:pa'si:d/ > /su:pasi:d/. /j/ is most 
commonly dropped after /1/ and /s/ (as, indeed, it was long ago after 
/r/). In the case of the alveolar plosives + /j/, coalescence is now 
increasingly common except initially in a stressed syllable where /t/ + 
/j/ or /d/ + /j/ tend to be retained. Thus educate /‘edju:keit/ > 
/'ed3u:keit/, statuesque / steetju: esk/ > / steetfu:'esk/. 


(S) Stress changes 


Certain patterns may be detected, especially in the change affecting 
adjectives in -able/-ible and -ary/-ory. In both classes of words, the 
stress tends now to fall later in the word, thus applicable > applicable, 
explicable > explicable, ‘justifiable > justifiable, ‘fragmentary > 
frag mentary, mandatory > mandatory. 

Similarly, the feminine suffix -ess increasingly attracts primary stress 
in words like coun'tess, lio'ness, prio ress, stewardess. 

Other current changes do not display such regular patterns and it 
remains to be seen which variant pronunciations at present co-existent 
will persist. 


6.3.5 English Worldwide 


Finally, it has to be recognized that the role of British RP in the English- 
speaking world has changed very considerably in the last century. Over 
300 million people now speak English as a first language, and of this 
number native RP speakers form only a minute proportion; the major- 
ity of English speakers use some form of American pronunciation. 


The Historical Background 91 


However, despite the discrepancy in numbers, RP continues for histo- 
rical reasons to serve as a model in many parts of the world, and, if 
a model is used at all, the choice is still effectively between RP and 
American pronunciation. 

A disturbing development concerns the use of English in the Indian 
and African continents, where English functions as a lingua franca 
superimposed upon a large number of indigenous languages. In these 
regions, the interference of the indigenous phonological structures is 
such that the efficacy of spoken English as a means of communication 
is fast being lost; intelligibility tends to fall to a low level within quite 
restricted areas, for reasons which are primarily phonetic. Even where 
some reasonable degree of intelligibility is retained within any one 
country, there are serious deficiencies of a mainly phonological kind 
which become apparent when English is used as an international means 
of communication. It is conceivable that, if such divergencies are not 
restrained, communication will be easily maintained only in the written 
language. It is for this reason of universal importance that efforts 
should be made to relate these developing forms of English pronun- 
ciation to either a British or American model. 

When it is a question of teaching English as a second language, there 
is clearly much greater adherence to one of the two main models. Most 
teaching textbooks describe either RP or American pronunciation, and 
allegiances to one or the other tend to be traditional or geographical: 
thus, for instance, European countries continue on the whole to teach 
RP, whereas much of Asia and South America follow the American 
model (see also Chapter 12). 


7 


The English Vowels 


7.1 The Distinctive Vowels 


A large number of vowel sounds (either relatively pure or clearly gliding 
in nature) have a distinctive syllabic function in English. Their oppo- 
sitional nature may be established by the commutations possible in such 


series as:— 

heed feel 

hid fill 

head fell 

had 

hard 

hod 

hoard fall 

hood full 
fool 

heard furl 
fail 
foal 

hide file 
fowl 
foil 


bead 
bid 
bed 
bad 
bard 


board 


booed 
bud 
bird 


bayed 
bode 
bide 
bowed 


beard 
bared 


pea 
city, except 
par 


paw 


purr 
sitter, accept 


pay 
pie 
peer 


pair 
poor 


A general phonetic assessment of the qualities of syllabic vowels, in 
terms of the Cardinal values, may be made from the following exam- 
ples, in word final and non-final positions:— 


Final. Non-final. 
I. Short 

city bid 

— bed 


Quality. Notation. 
retracted, raised [e] 
between [e] and [e] e 
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Final.  Non-final. Quality. Notation. 
— had between [e] and [a] e 
— hod [o] D 
- hood advanced, raised [o] U 
bud centralized, raised [a] A 
sitter accept central = 


II. Long (relatively pure) 


pea bead lowered [i] or [1] or [13] i 
do food lowered [u] or [uu] or [uw] u: 
par bard fronted [a] a: 
paw board raised [9] a: 
purr bird mid central 3: 
Ill. Long (diphthongal glides, with prominent Ist element) 
(a) glide to [1] 
pay paid lowered [e] — [1] el 
pie hide [a] — [1] al 
coy coin [o] — [1] 91 
— ruin [u] — [1] U1)? 
(b) glide to [u] 
low load mid [9] — [u] 3U 
bough bowed between [a] and [a] — [u] au 
(c) glide to half-open [ə] 
peer beard [1] — [ə] 19 
pair bared [e] =- [9] £9 
(soar soared [9] — [ə] 93) 
poor moored [u] — [ə] Ud 
Notes 


(1) Most people will make a considerable difference of length between 
the vowels in hat, had, and bad when the words are said in isolation, the 
vowel in bad being as long as any of the ‘long’ vowels. Nevertheless, this 
length is not a constant distinctive feature of the vowel, but is rather 
dependent upon the context or 1s characteristic of the pronunciation of 
particular words. Since the short vowel is the more common and 
because the distribution of [ee] is like that of the other short vowels, the 
vowel is included in the table of short vowels. 

(2) The so-called pure vowels of bee and do frequently contain a 
glide between two distinct elements, especially in a final position. 
Nevertheless, because the qualities of the elements are phonetically 
closely related and because a non-gliding vowel is not uncommon or 


l Occuring only in unaccented syllables (see §7.9.12). 
2 [ui] may be regarded as a reduced version of /u:/ + /1/ (see 87.11). 
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thought to be un-English, these two vowels may on phonetic grounds be 
included in the ‘long, pure’ list. 

(3) Many speakers do not possess the distinction saw/soar, i.e. pure 
vowel y. diphthong, but use the pure vowel in both cases. For this type 
of pronunciation, class III (c) will be reduced by one item. 

(4) The vowel glide [v1] exemplified by such words as ruin, fluid, 
suet, is of extremely rare occurrence within one syllable, though it may 
occur as a reduced form of [u:] + the suffix -ing, 1.e. in the case of the 
juxtaposition of two syllables, e.g. in doing. The sequence [u] + [i] may 
occur across word boundaries as a result of smoothing e.g. newest 
[njuist]. (See §7.11.) Since this diphthong carries such a low distinctive 
weight, it may be omitted from the list of contrasting vowels. 

(5) Of the short vowels, it is to be noted that [a] occurs only in 
unstressed syllables. 


7.2 Vowel Glides with a Non-prominent Ist Element 


We find that the sounds (semi-vowels) [j] and [w] regularly occur in 
positions preceding most of the above basic vocalic elements: — 


[j + 1] yiddish [w + 1] wit 
Ml [w + e] wet 

[y + e] yap [w + æ] wax 

[j + o] yacht [w + o] watch 
[] + u] you (weakly stressed) [w + U] wood 

[J + a] young [w + A] won 

[J + ə] failure [w + ə] were (weakly stressed) 
[J + i:] yeast [w + i:] week 

[] + uz] youth [w + u:] woo 

[j + a:] yard [w + a:] waft 

[y + 9:] yawn [w + 9:] wall 

[j + 3:] yearn [w + 3:] word 
[j + el] yea [w + el] way 

[j + aı] [w + at] wide 

[y + 91] yoick [w + 91] quoit 
[J + 26] yeoman [w + au] woe 

[y + au] yowl [w + au] wound 
[j + ta] year [w + 19] weir 

[J + ea] (Yare) Íw + £ə] wear 

[J + va] lure [w + Ua] 


Since [j] and [w] are often purely vocalic from a phonetic standpoint, 
being rapid vocalic glides from [i] and [u] positions, it is possible to 
consider their combination with other vowels as constituting: — 

(a) a rising diphthong, in the case of [j] or [w] followed by a short 
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vowel or long, relatively pure, vowel, i.e. vowel glides in which the 2nd 
element rather than the Ist is the more prominent; 

(b) a triphthong, in the case of [j] or [w] followed by a vocalic glide, 
1.e. a vowel glide in which there are three vocalic elements, the central 
one being most prominent. 

Nevertheless, since such combinations affect almost all our pre- 
viously established basic vowels and glides, it would add enormously to 
our inventory of basic vowels if we were to include these combinations 
in our list. Moreover, these [j] and [w] elements function very much as 
1f they were consonants, marginally rather than centrally in a syllable, 
and, indeed, in cases such as the [j] in tune or the [w] in queen tend to be 
voiceless and to have the friction which is phonetically characteristic of 
a consonant. For these reasons, it is more convenient to treat initial [j] 
and [w] as separate from the vocalic nucleus of the syllable and to 
include them in the list of consonants. 


Note 


It is possible, in the same way, to express the brief [1] and [u] elements 
occurring post-centrally in the diphthongal glides as consonantal /j, w/ 
following a simple syllabic element. Thus, if [1:] and [u:] are interpreted 
as glides and [ut] similarly included, the following statement may be 
made: ua (elle— e (aul — a bil- O (007012; [au] 
= /aw/; [au] = /aw/; [u:] = /uw/, with, in addition, the following 
possibilities of /j/ or /w/ preceding and following the central syllabic 
element: /jij, wij, jej, wej, waj, Joj, WOJ, Jaw, waw, waw, juw, wuw/. In 
the present treatment of RP vowels, however, such post-central [1, u] 
elements are regarded as vocalic rather than consonantal because:— 

(1) they do not have a distribution after all vocalic elements as 
general as that which we find in the case of pre-central /], w/; 

(11) they are in RP very weakly articulated (compared with pre- 
central /j, w/) and may be realized merely as a prolongation of the 
central syllabic vowel quality, e.g. [e:] as a variant of [e1]; 

(111) they have none of the fricative (phonetically consonantal) allo- 
phones characteristic of pre-centra] /j, w/ following /p, t, k/. 


7.3 Glides to [ə] 


Similarly, glides to [9] are treated here as composite vocalic units, since 
in general RP [a] combines with a syllabic vowel element only after [1, e, 
U] and may with some RP speakers be realized merely as an extension of 
the preceding syllabic vowel element. 


96 An Introduction to the Pronunciation of English 


7.4 Vowel Length 


There are traditional relationships between short and long vowels in 
English, as illustrated by the following words:— 


bid and bead /11:/ 

good and food /U-u:/ 
cad and card AS 
cod and cord /D-9:/ 


(for)ward and word /2-3:/ 


Notes 


(a) Only in the case of /a-3:/ can there be said to exist an opposition 
solely of length and even in this case it has to be stated that /3/ occurs 
only in unaccented syllables, whereas /3:/ can occur in syllables carry- 
ing primary or secondary accent. 

(b) In the other cases the opposition between the members of the 
pairs is a complex of quality and quantity; and of the two factors it is 
likely that quality carries the greater contrastive weight. Indeed, in the 
particular case of the cad/card opposition, both vowels may be equally 
long. 

(c) In a transcription which sets out to show explicitly certain 
phonetic characteristics, 1t would seem advisable to indicate especially 
the qualitative opposition, at the same time noting quantity by means of 
the length mark. 

(d) Such insistence on purely phonetic characteristics, as compared 
with what is phonemically relevant, is justified when variations of 
vowel length are taken into consideration. In accented syllables the 
so-called long vowels are fully long when they are final or in a syllable 
closed by a lenis consonant, but they are considerably shortened when 
they occur in a syllable closed by a fortis consonant.! Thus:— 

/\:/ in beat is only about half as long as the /1:/ of bee or bead and 
may, in fact, be of approximately the same length (duration) as the /1/ 
vowel of bid; 

/u:/ in boot is only about half as long as the /u:/ of do or food and 
again has about the same duration as the /U/ vowel in good. 

This is also true of /a:, 3:/ and /3:/. If the length sign [:] is retained 
merely to show length, then it 1s possible when desirable to indicate 
phonetic variations thus: — 


bee, bead, beat, bid [bi:, bizd, bit, bid] 
do, food, boot, good  [du:. fu:d, but, gud] 
car, card, cart, cat [ka:, ka: d, kat, ket] 


! This reduction process is referred to by J. C. Wells as pre-fortis clipping. 
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caw, cord, caught, cod [ko:, ko:d, kot, kod] 
her, heard, hurt [hs:, h3:d, hat] 


A single length mark may be used to indicate a further intermediate 
duration if so desired e.g. [bi'n). 

(e) The same considerable shortening before fortis consonants 
applies also to the diphthongs:— 


cf. play, played, plate 
row, road, wrote 

tie, tide, tight 

cow, loud, shout 
boy, noise, voice 
fear, fears, fierce 
scare, scares, scarce 


Here the variation of length might, if desirable, be shown explicitly 
thus: [ple:1, plezid, pleit]. 

Since lenis /m, n, 1/ do not have fortis counterparts, the variation of 
length in the preceding vowel does not apply when they close a syllable; 
/y/ does not normally close a syllable containing a long vowel or 
diphthong. 

(f) Many phoneticians have measured the duration of English 
vowels in different phonetic contexts. A recent study of this kind is that 
of Kalevi Wiik (1965), which shows, in summary, the following average 
relationships in csecs. in accented monosyllables: 


Final or + lenis C + nasal C + fortis C 
Short vowels 17-2 13.3 10-3 
Long vowels 31-9 23-3 16-5 
Diphthongs 35-7 26-5 17-8 


(1) /æ/ is not included in the category of short vowels, because of the 
special length often associated with it (see §7.9.4), but is classified 
separately as ‘neutral’. The following measurements for /æ/ are given 
by Wiik: + lenis fricative—25-2 csecs., + lenis plosive +21-6csecs., + 
nasal—19-6 csecs., + fortis fricative—16-5 csecs., + fortis plosive— 
15-0 csecs. 

(11) An example of the relationship of two vowels traditionally paired 

(/1:/~/1/) shows the following typical descending durations:— 
/i:/ + lenis fricative—36- csecs., /1:/ + lenis plosive—28-5 csecs., 
/i:/ final—28-0 csecs., /1:/ + nasal—19-5 csecs., /1/ + lenis fricative 
— 18-6 csecs., /1/ + lenis plosive—14-7 csecs., /1:/ + fortis fricative 
— 13.0 csecs., /1:/ + fortis plosive—12-3 csecs., /1/ + nasal—11-0 
csecs., /1/ + fortis fricative—8-3 csecs., /1/ + fortis plosive—7-3 
csecs. Thus, it will be seen that /1:/ is typically shorter in a word such as 
seat (12-3 csecs.) than /1/ in a word such as Aid (14-7 csecs.). 
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7.5 Phonemic Interpretation of the Vowels 


There are a number of ways, all equally legitimate, of allocating the 
English vowels to phonemic categories. Most phonemic analyses will 
apply to only one type of English, but some attempts! have been made 
to provide categories which are valid for all kinds of English. It is 
doubtful, given the tremendous variety of pronunciations of English 
throughout the world, whether such analyses are desirable or wholly 
successful. But several solutions have been proposed for RP and of 
these no solution can be said to be more ‘correct’ than another. The 
choice of solution will depend largely on the purpose of the analysis. 
Thus, one kind of analysis will be convenient when dealing with the 
synthesis of speech, whereas others will be appropriate when it is a 
question of teaching English pronunciation to a particular nationality 
or when a comparative study of various types of English is being made. 
In particular, it is not always to be assumed that the most economical 
analysis is the most useful. 

The main variations of interpretation concern the treatment of the 
long vowels and diphthongs. Daniel Jones, for instance, lays emphasis 
upon the distinctive importance of length. His analysis of the English 
vowels is concerned not only with categories of quality (phonemes), but 
also postulates two categories of length (long and short chronemes). He 
is able in this way to distinguish the members of the pairs [i:, 1; 9:, D; 
u:, U; 3:, 3] by means of a length, or chronemic, feature, to which the 
qualitative contrast 1s subordinate. These vowels are, therefore, long 
and short members of four phonemes and the pairs may be written in 
the simplified form /1:, 1; 0:, 0; u:, u; 9:2, 9), where the sign [:] indicates 
conventionally not only length but also a related quality. It is possible 
on purely logical grounds to extend this analysis to include, despite the 
considerable qualitative separation, the pair [a:, ee], which might then 
be expressed /a:, a/. Such an analysis reduces the number of vowel 
phonemes by five. A transcription based on this analysis is simple for 
the phonetician to handle, but may have the disadvantage of being less 
explicit and more conventional than a beginner or student may require. 

Other types of analysis begin with a basis of the short vowels. The 
basic vowel phonemes of English may, for instance, then be considered 
as /1, €, Æ, 9, U, A, 9/, simplified in notation to /i, e, a, o, u, a, 3/. The 
remaining vowel sounds are then regarded either as sequences of these 
elements or as compounds of one of these elements + /j/ or /w/ (see 
§7.2). Thus:— 


[1:] is interpreted as /1 + 1/ or /ij/ 
[uz] i “ JU + u/ ” /uw/ 
[e1] fis a Je + 1/ ” /ej/ 


| See G. L. Trager and H. L. Smith, An Outline of English Structure, 1951. 
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[ai] 4 /a + 1ı/ ” /aj/ 
[91] as be Jo + if” /oj/ 
[ou] “ a /a + U/ ” /aw/ 
[au] $ ix Ja + U/ ” /aw/ 


and [1, e, uv] combine with /3/ to give /i3, es, ua/. There remains the 
difficulty of [a:, 3:1, 3:]. These may be treated symmetrically as glides to 
[a] (e.g. /ad, 03, 99/), or as sequences of /a +a, 0+0, 3+29/, or again, 
as /a+r, o+r, 9+r/. This last notation would repose, as far as RP is 
concerned, upon the complicated convention that, in a pronunciation 
which regularly uses an ‘intrusive’ [r] when [a:, 9:, ə] occur finally in a 
word followed by another word beginning with a vowel (e.g. Shah of 

. ., saw it, vanilla ice), the sequences /a+r, o+r, a+r/ are to be 
treated as [r]-less vowels before a pause or when a consonant 
follows.Thus, far /far/ = [fa:]; saw /sor/ = [so:]; cur /kar/ = [k3:]; 
palm /parm/ = [pa:m]; lawn /lorn/ = [lo:n]; colonel /karnl/ = 
[k3:nl]. Such words as carry, lorry, periphery /‘kari, ‘lori, pa rifori/ 
would be distinguished from starry, story, furry thus: /'starri, ‘storri, 
forri/. [a:, 9:, 3:] may also be analysed as /a, o, 9/ + /h/,! /h/ being 
realized here in vocalic form in complementary distribution with its 
more usual consonantal articulation. It will be seen that, although the 
basic phonemic units may be reduced in number to seven, an analysis in 
terms of composite units is complicated and may be remote from a 
speaker’s impression of reality and that the resultant notation will 
depend for interpretation on a great number of conventions. 


7.6 Conclusion 


The solution used in the following pages is one which does not seek 
ultimate economy of categories, gives a good deal of explicit informa- 
tion in the notation about the phonetic realization of the phonemes 
(especially the relation of quality and quantity), and takes some 
account of the RP speaker’s own feelings as to the distinctive vowel 
counters which he uses. We will, therefore, treat twenty vocalic pho- 
nemes, made up of the following vowels or vowel glides: — 


7 Short: /1, €, 22, D, U, A, 9/ 

> long (relatively pure): Us ai, 31, 3:7 
3 long (glides to [1]): /e1, al, 91/ 

2 long (glides to [u]): /au, au/ 

3 long (glides to [a]): /19, €3, Ua/ 


l See G. L. Trager and H. L. Smith, op. cit. 
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7.7 Description of the Vowels 


In the following detailed descriptions the RP vowel phonemes will be 
treated in a sequence based upon their quality relationships, viz.:— 


JAZ, 1503.2, A, At, DU OÍ 
/el, al, 3I, 3U, aU/ 
18, £9, 57 


The treatment of each vowel will include:— 

(1) Illustrations of spelling forms of variations of length of long 
vowels and diphthongs, of vowels followed by ‘dark’ [t], and compara- 
tive examples for practising variation of length or differentiation from 
neighbouring phonemes. 

(2) Articulatory description and an assessment of quality in relation 
to the Cardinal Vowels. (In all cases, unless otherwise stated, the soft 
palate will be assumed to be in its raised position, the vocal folds 
vibrating, and the tongue tip behind the lower teeth.) Remarks on 
distributional features in the word and syllable. 

(3) Indications of some of the chief variants—regional and social. 

(4) Remarks on the principal historical sources of the vowel. : 

(5) Difficulties encountered by foreign learners, with appropriate 
advice. 


7.8 Acoustic Features 


The following formant regions! (in cps) are representative of RP rela- 
tively pure vowel values (excluding /2/, whose quality varies greatly 
according to its situation and for which an average value may be taken 
to be equivalent to that of /3:/); in vowel synthesis, a combination of 
variable Fl and F2, with a fixed F3 in the region of 3,000 cps, gives good 
results. 


Fl F2 F3 
TA 280 2,620 3,380 
a 360 2,220 2,960 
Ter 600 2,060 2,840 
1&/ 800 1,760 2,500 
/A/ 760 1,320 2,500 
FOZ 740 1,180 2,640 
/0/ 560 920 2,560 
l 480 760 2,620 
/u/ 380 940 2,300 


'The formant figures given here were provided by Dr. J. C. Wells, University College, 
London, from an unpublished thesis on the formant structure of RP vowels. 
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NA 320 920 2,200 
13 560 1,480 2,520 


7.9 Short and Long (Relatively Pure) Vowels 
7.9.1 A7 


(1) Examples.—ee—tree, cheese, canteen 
e—complete, be, these 
ea—leaf, reason, sea 
¡e—piece, field, siege 
el, ey—selze, key, receive 
i—machine, police, prestige, suite 


(Note: /1:/ in ‘quay, people, Beauchamp’ /'bi:tfom/.) 


Long [i:]—see, seed, seen; fee, feed, fees 

Reduced [i]—seat, feet, piece, lease, beef, reach 

Compare [i:], [i]—bead, beat; seize, cease; leave, leaf; liege, leach; 
Eden, eaten 

[1] following—feel, meal, field, eels. 


(2) Description.—The front of the tongue is raised to a height 
slightly below and behind the close front position; the lips are spread; 
the tongue is tense, with the side rims making a firm contact with the 
upper molars. The quality is nearer to C[i] (with the glide mentioned 
below) than to C[e]. /1:/ does not normally occurina syllable closed by 
ll / 

(3) Variants.—The vowel is often noticeably diphthongized, espe- 
cially in final positions. A slight glide from a position near to [1] is 





Fic. 9.—/i:/ and variants. 
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common amongst RP speakers, being more usual than a pure vowel. 
Any glide having a starting point in the central area is, however, 
dialectal, that is, characteristic of a regional accent; a glide of the 
type [ii] is characteristic of the local pronunciation of Liverpool 
and Birmingham, whereas a lower central starting point may be 
heard in popular London English and many other dialects. The use 
of a pure vowel in a final position may be typical of an over- 
cultivated pronunciation; this is not the case, however, in Scottish 
English, where the vowel generally does not have the length charac- 
teristic of RP and is not, therefore, subject to the same tendency to 
diphthongization. 

(4) Chief Sources.—(a) Many of the words now said with /1:/ have 
had this vowel since at least 1500; these are frequently spelt with ee or ře. 
Such words are those which in OE! had a long vowel in the region of 
C [e] (cheese, sleeve); or had developed C[e:] from a diphthong [e:3] 
(deep, thief); or from a more open vowel near C [e:] (deed, needle); or 
from [o:] (geese, green), or from a lengthening of [1] in an originally 
open syllable (week) or [e] before [td] (field). In addition, a number of 
words from OF [ie] through AN [e:] (siege, niece, grief). 

Foreign words with [1:], adopted after the English [i:] vowel from the 
above sources had become established, fell in with the usual pronuncia- 
tion of ee in English and sometimes changed their spelling from / to ee 
(esteem, canteen; machine, routine). 

(b) Another group of words, frequently spelt now with ea, finally 
developed [i:] at about the end of the seventeenth century. Their chief 
origins are: an OE long vowel in the region of C[e:] (sea, leave, teach); 
an OE diphthong [e:9] (leaf, stream, east); an OE short [e] lengthened 
in originally open syllables (meant, eat); and, in addition, a large 
number of words from OF fal, ei] through AN [e:] (please, eager, 
season, peace). 

(5) Advice to foreign learners.—This vowel should give little diffi- 
culty to foreign learners, all of whom will have in their language a vowel 
of approximately the same quality. Their own vowel may not have the 
diphthongization which is typical of RP, but they should attempt to 
imitate this glide only with caution, since any exaggeration will sound 
vulgar or dialectal. More important is the reduction of length before 
fortis consonants, as exemplified above, since the differentiation 
between two words such as seize and cease is achieved more by the 
variation of the vowel length than by the quality of the final consonant. 
The reduced form of the vowel should, however, remain relatively tense 
and not be confused with [1]. 


l! For meaning of abbreviations see §6.2. 
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9.2 /1/ 


(1) Examples.—i—sit, fifth, with, rich 
y—city, rhythm, symbol 
e—pretty, needed, wicket, wicked, except, careless, 
houses 
¡e—ladies, cities 
a—village, private 


(Note: ‘build’ /bild/, “Sunday” /'sandı/ (and the other days of the 
week), ‘business’ /‘biznis/, ‘women’ /'wimin/, ‘minute’ (n.) /‘minit/, 
‘England’ /‘inglond/.) 


Compare {i:], /1/—feel, fill; seen, sin; bead, bid 
[i], /1/—least, list; reach, rich; sheep, ship; week, wick; 
feet, fit 
[iz], [i], /1/—seed, seat, sit; league, leak, lick; seized, 
ceased, cyst 
[tł] following—will, hill, milk, built, film, kiln 


(2) Description.—The short RP vowel /1/ is pronounced with a part 
of the tongue nearer to centre than to front raised just above the half- 
close position; the lips are loosely spread; the tongue is lax (compared 
with the tension for /1:/), with the side rims making a light contact with 
the upper molars. The quality is that of a centralized C[e] = [ë]. /1/ 
may occur in all positions in the word. 

(3) Variants.—The degree of closeness and centralization varies 
according to the accentual force falling upon the vowel and its position 
in the word (cf. the realizations of /1/ in the word visibility, those of 
syllables 1 and 3 being near to the sound described above, those of 
syllables 2 and 4 being somewhat more centralized, and that of the last 
syllable sometimes having a tongue position lower than half-close'). A 
trend towards /a/ in unaccented syllables, traditionally with /1/, is 
becoming increasingly noticeable among RP speakers of the middle and 
younger generations. 

(a) In some terminations, /3/ is now more common than /1/, e.g. 
-ity: /-ati/ rather than /-iti/ as in sincerity, quality, etc. 

-Itive: /-ativ/ rather than /-itiv/ as in positive, fugitive, etc. 

-ily: /-ali/ rather than /-il1/ (especially after /r/) as in merrily, pri- 
marily and also easily, happily, etc. 

-ate: often /-at/ rather than /-it/ as in fortunate, chocolate, etc. 
(Words such as magistrate, candidate sometimes have /-eit/; cer- 
tain words such as climate, private still show /-it/ as dominant.) 

-ible: /-abl/ (as for -able) rather than /-1bl/ as in possible, visible, etc. 


t Note that /1/ is not opposed to /e/ in final, open syllables, where such lowering is 
particularly apt to occur in conservative speech. 
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-em: /-am/ rather than /-1m/ or /-em/ (conservative RP) as in 
problem, system, item, etc. 

(b) In the case of other weak syllables, both /1/ and /3/ are heard 

from RP speakers, e.g. 

-ess: /-1S/ or /-3s/ as in useless, goodness (/-as/ being preferred by the 
younger generation); in cases where -ess is strongly felt as a 
feminine suffix and may be accented by some speakers, /-es/ 
may be used whether the syllable is strong or weak. 

-ace: /-18/ or /-as/ as in necklace, palace, preface, with an increasing 
tendency to /-as/. 

(c) In other cases, /1/ remains dominant, e.g. 

-age: predominantly /-1d3/ in manage, village, etc. (Recent French 
borrowings such as barrage, camouflage, etc., tend to have 
/-a:(d)3/.) 

-et: predominantly /-1t/ especially following /k, g, tf, d3/ as in pocket, 
target, hatchet, budget, etc. However, the endings -lef, -ret often 
have /-at/, as in bracelet, scarlet, claret, garret, etc. 

be-: /1/ is more common than /3/, as in begin, between, become 
(though in believe, belong, behave, etc., /a/ 1s often heard). 


(Note: Although se-, as in sedition, select, may sometimes have /sa-/ 
rather than the more usual /s1-/, de-, as in deposit, deny, desire, 
etc., is almost invariably with /di-/.) 

In the preceding cases, no significant oppositions between /1/ and 
/a3/ were involved. Where an opposition exists, it might be expected that 
there would be some pressure to retain the /1/ ~ /a/ distinction, as in 
the inflected forms offices /-1z/ v. officers /-az/ or chatted /-id/ v. 
chattered /-3d/. The neutralization of this opposition is typical of 
several non-RP forms of English, but the opposition is still strongly 
maintained by RP speakers. On the other hand, potential oppositions 
between /1/ and /3/ in such pairs as effect, affect and except, accept are 
commonly lost in favour of /9/. 

In final, unaccented positions, as in city, Mary, lady, etc., /1/ is 
increasingly replaced in the speech of the younger generations by a 
short variety of /1:/. In popular London speech, this final /1:/ will 
usually be realized as a glide [ai]. On the other hand, in most kinds of 
English, /1/ may replace /1:/ in the unaccented (weak) forms of such 
words as he, she, been, etc. 

Some RP speakers—especially those using the type of speech which 
has the [é] variety in unaccented syllables—will diphthongize /1/ 
towards [ə] particularly in accented monosyllables, e.g. in big, did, 
thin, wish, etc. 

(4) Chief sources.—As in the case with many of the PresE short 
vowels, /1/ has shown considerable stability since OE. The OE forms of 
such words as ship, quick, give, drink, this, smith, etc., probably had 
vowels similar in quality to that of RP /1/. PresE /1/ also derives from 
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closer 





FIG. 10.—/1/ and variants. 


OE [i:] (wisdom, bliss); from OE [y] (king, kiss, bridge); from OF [i] 
(rich, simple, mirror, prison). In addition, OE or OF [e] + nasal con- 
sonant has often given PresE /1/ (England, string, ink, chimney). 

In unaccented syllables many new cases of /1/ have arisen, notably 
from [e] or [3] in such suffixes as -es, -ed, -ness, -less, -est (horses, 
waited, kindness, hopeless, biggest); from formerly accented syllables 
with [e] or [a:] (hostess, prophet, village, Highgate, orange); in prefixes 
and medial vowels spelt with e (describe, enquire, despite, before, 
declare, expect, elegant, benefit); from earlier diphthongs (forfeit, 
sovereign, fountain, journey); and many cases of PresE final /1/ from 
earlier [1: > 21] with a secondary accent (charity, majesty, memory, 
heavenly). 

(5) Advice to foreign learners.—lIt is of the utmost importance that a 
proper qualitative relationship should be maintained between /1:/ and 
/1/. Many languages have a short variety of [i], e.g. French, Italian, 
etc., but one which is likely to be too tense and close for the English /1/; 
others, e.g. Polish or Russian, have a centralized type of [1] which has 
too much of an [ə] quality for English; others, e.g. German, have a type 
of [1] near to the English variety but yet often too tense. Speakers of 
those languages which possess a vowel of the Cle] type (which is 
approximately on the same level as the English /1/) should modify this 
sound in the direction of [9]; alternatively, a [y] sound, asin French but, 
said with relaxed spread lips, will come near to the English /1/ as in bit. 

Of equal importance is the quantitative relationship of /1:/ and /1/. 
Once the correct quality of /1/ has been acquired, most learners can dis- 
tinguish bead [bi:d] from bid [bid], where the distinction is a complex 
of quality and quantity. But an opposition of the sort beat [bit] —bit 
/bit/, where the difference of vowel length is insignificant, is more 
difficult. Three types of vowel should, therefore, be practised: 
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close, tense, long [1:] (bead); close, tense, short [1] (beat); and the half- 
close, lax, short [1] (bit, bid), as in the comparative examples given in (1) 
above. 

The fact that /1/ occurs very frequently in unaccented syllables 
should also be noted, since an unreduced vowel in the weak syllables of 
such words as village, waited, fountain, describe, may seriously deform 
the accentual pattern for the native listener. 


7.9.3 /e/ 


(1) Examples.—e—set, bed, went 
ea—dead, head, breath 
a—many, Thames 


(Note: the following words with /e/—‘says, said, bury, Geoffrey’ 
/‘dzefri/, ‘Leicester’ /lesta/, ‘friend, ate, again’ /a'gen/ or /a'gein/.) 


Compare. —/1/, /e/—sit, set; tin, ten; will, well; disk, desk 
/i:/, /1/, /e/—neat, knit, net; reach, rich, wretch; reed, 
rid, red; feel, fill, fell 
[t] following—well, sell, else, health, elm, held 


(2) Description.—For the short RP /e/, the front of the tongue is 
raised between the half-open and half-close positions; the lips are 
loosely spread and are slightly wider apart than for /1/; the tongue may 
have more tension than in the case of /1/, the side rims making a light 
contact with the upper molars. The quality lies between that of C[e] and 
that of C[e] = [e] or [e]. /e/ does not occur in final, open syllables. 

(3) Variants.—The general RP variety of /e/ tends to be closer to 
C[e] rather than to C[e]. This may be due to the fact that /æ/ in this 
type of pronunciation is only a very little more open than half-open; /e/ 





FIG. 11.—/e/ and variants. 
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is, therefore, considerably closed in order to maintain the qualitative 
distinction. An /e/ which is very near to C [e] is typical of over-refined 
RP and may often be associated with the closer type of /z/. If /e/ has a 
quality nearer to the half-open type, as in some kinds of RP and many 
regional dialects, /22/ in turn is more open. 

An advanced RP form of /e/ is diphthongized in the direction of [a], 
e.g. men, said, get [me’n, sed, ge*t]. Such diphthongization is often 
characterized as ‘affected’. Another diphthongal glide, in this case in 
the direction of [1], is heard in popular London‘speech, particularly in 
monosyllables closed by a lenis consonant, e.g. bed, leg [be'd, le'g]. 

(4) Chief sources. —The majority of PresE /e/ forms can be traced 
to: OE [e] (bed, neck, edge, best); shortened OE [e:] (fed, met); OE 
[e:3] (theft, friend), OE [e ] (let, ever, flesh); eModE shortening of 
ME [e:] (bread, death, deaf); OE [y] (bury, merry)—the Kentish [e] 
form having been adopted; OF [e] (debt, press, accept, second, 
member). Some modern /e/ forms previously vacillated between /e/ 
and /æ/ (then, when, any, many); others derive from a former [ei] 
(Leicester, said, says). 

(5) Advice to foreign learners.—This vowel may present difficulties 
to those foreign learners whose native language possesses two types ofe 
usually of C [e] and C [e] qualities. Very often such a learner equates the 
English /e/ with his own half-open variety, thereby using a vowel of too 
open a quality which might be confused by RP listeners with /æ/. He 
should therefore modify his vowel in the direction of his own, closer, 
C [e] sound. 


7.9.4 ea 


(1) Examples.—a—sat, hand, lamp, rash, marry 
ai—plait, plaid 
Compare.—/e/, /2/—pet, pat; peck, pack; said, sad; ten, tan; 
lend, land; merry, marry 
/1/, /e/, /2/—bid, bed, bad; big, bed, bag; tin, ten, 
tan; miss, mess, mass 
[ee: ] (before /b, d, 9) d3/) and [ee] —cab, cap; bad, bat; 
bag, back; badge, batch 
[+] following —alphabet, shall (accented form), Pall Mall, balcony, 
scalp 


(2) Description. —The mouth is slightly more open than for /e/; the 
front of the tongue is raised just below the half-open position, with the 
side rims making a very slight contact with the back upper molars; the 
lips are neutrally open. In the south of England /z/ is often produced 
with considerable constriction in the pharynx, the tongue itself having 
rather more tension than isthe case for /e/. The quality is nearer to C [e] 
than to C[a] = [e]. 
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Fic. 12.—/z/ and variants. 


This traditionally short vowel appears to be lengthened in RP espe- 
cially before the lenis consonants /b, d, g, d3, m, n/ (cab, bad, bag, 
badge, jam, man). Though vowels are regularly longer before syllable- 
final lenis consonants than before fortis consonants, the lengthened [æ] 
is equivalent in quantity to the longest varieties of /i:, az, 9:, 3:/. In 
terms of the system, this may be due to the increasing qualitative pro- 
ximity in RP of /e/ and /&/, the extra length serving as an additional 
distinctive feature; the qualitative-quantitative relationship of /z/-/e/ 
tends, therefore, to become of the same type as /1:/-/1/, 1.e. 
(ze: ]-[ze]-[e] (bad, bat, bet), cf. [i:]-[i]-[1]. /Æ/ does not occur in final, 
open syllables. 

(3) Variants.—/z/, the most open of the RP true front vowels, is 
closer than fully open. This compression of the front phonemes is a 
characteristic of RP, but regional dialects often show a greater qualita- 
tive separation of these phonemes. Thus, where RP /e/ and /x/ have 
the values described, other types of English will have values C[e] and 
C [a], or [a] or even [a]. (A more relaxed /ze/—in the region of C[a]—is 
also heard amongst children in the south of England who otherwise 
have an RP system and who, later in life, adopt the tenser and closer 
variety of /æ/. Such a lowered /æ/ is maintained by many young 
women, although /A/ continues to be realized as the low front variety 
described in §7.9.5. The result can be a confusion of /z/ and /A/, the 
meaning being resolved by the context.) 

On the other hand, that type of refined RP (and also popular 
London) which realizes /e/ in the C[e] region also raises /z/ to 
approximately C[e]. In this case, the /e/-/z/ opposition is reinforced 
either by the lengthening of /z/ already mentioned, or by a diph- 
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thongization of /z/ towards [a], e.g. bad, cat [beerd, kz*t] (cf. similar 
diphthongization of /1/ and /e/). 

(4) Chief sources.—PresE /æ/ derives regularly from: OE [a] (man, 
cat); OE [æ] (sad, back, apple); OE [e:3] (shadow, shank); shortened 
OE [a:] (hallow) or OE [e:] (ladder, mad)—this shortening being 
earlier than that of the death type where ME [e:] > eModE [e]; Scan- 
dinavian [a] (flat, anger). Also regularly from OF [a] (lamp, manner, 
passage) and occasionally from OF [au] (sa/mon, savage). Most earlier 
sequences of the type [wa-] have given [wb-] or [wo:-] (watch, quality, 
water); /z/ is, however, retained in some cases, especially when a velar 
consonant follows (wag, wax, twang). 

(5) Advice to foreign learners.—The main difficulty for all those 
whose own languages have a less complex vowel system lies in the estab- 
lishment of the qualitative oppositions /1/-/e/-/z/, while at the same 
time using a type of /z/ which is not too open. Foreign learners often 
find it helpful to make a conscious constriction of the pharynx for /2/. 
The opposition /e/-/z/ may be emphasized by making use of the 
length component now a feature of RP /2/ in certain contexts, e.g. in 
men, man; bed, bad. Where length may not be so distinctive, e.g. in net, 
gnat, the quality separation should not be such that /æ/ comes near to 
C[a]; if this does occur, there is the danger, in the south of England, of 
confusion with the /a/ of nut. 


7.9.5 /A/ 


(1) Examples. —u—sun, cut, dull 
o—son, come, among, one, done, month, colour, 
monkey, mother, nothing, Monday, onion, 
London, oven 
ou—country, southern, couple, enough, young 
oo—blood, flood 
oe—does 


(Note: many earlier u spellings have been changed to o, especially in 
the vicinity of u, m, n, w, v, e.g. in ‘love, some, won’, etc.) 


Compare /x/, /A/—cat, cut; lamp, lump; match, much 
Ja: /, /A/—cart, cut; barn, bun; march, much 
/o0/, /A/—cot, cut; fond, fund; wander, wonder 
/3:/,/A/—curt, cut; fern, fun; turf, tough 

[1] following—dull, result, pulse, bulge, bulb 


(2) Description.—The short RP /a/ is articulated with a con- 
siderable separation of the jaws and with the lips neutrally open; the 
centre of the tongue (or a part slightly in advance of centre) is raised just 
above the fully open position, no contact being made between the 
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/ ail i 
a TA = & 


Fic. 13.—/a/ and variants. 


tongue and the upper molars. The quality is that of a centralized and 
slightly raised C[a] = [&]. /a/ does not occur in final, open syllables. 

(3) Variants.—The variety of /a/ described above is that of general 
RP as used by younger people, especially in the London region. Con- 
servative RP speakers will often use a more retracted vowel, i.e. an 
unrounded and centralized type of C[9]. Regional speech of the 
London area has for /a/ an open front vowel very close to C[a]. In 
northern regional speech, a half-close back vowel is used, with or 
without lip-rounding, i.e. [u] or [Y]. In the same type of English, some 
words, spelt with o and with /a/ in RP, may have /D/, e.g. one, among, 
nothing. In RP both /a/ and /D/ may be heard (for an earlier /A/) espe- 
cially in words where the spelling form o is followed by a nasal con- 
sonant in an accented syllable, e.g. Montgomery, Bromley, accom- 
plish, combat, Tonbridge, comrade, conduit, constable. 

(4) Chief Sources.—PresE /a/ is the regular development of the ME 
fronted back, half-close lip-rounded [u]. This in turn derives from OE 
and OF sources: OE [u] (sun, love, nut, ugly); shortened OE [u:] (us, 
husband, utter, enough, scum), OE [y] (blush, much, such); OF [u] 
(cousin, touch, dozen, colour, cover) and OF [u] or [o] before a nasal 
consonant (number, sum, front, uncle, comfort, money); OF [y] (just, 
Judge, public, study). The ME [u] sound has, in those words where we 
now have /a/, i.e. [4], been lowered and has lost its lip-rounding. The 
unrounded stage appears to have been reached by or during the seven- 
teenth century in the London region, though at that time the lowering 
may not have been very considerable, i.e. a centralized variety of C[¥]. 


This maybe transcribed as [e]. See I.P.A. chart on pp. 340-1. 


The English Vowels 111 


A half-open stage (probably somewhat fronted from true back) may be 
postulated for the eighteenth century, with the quality C[A] typical of 
the nineteenth century, the tongue position being by the end of the 
century rather below half-open. Thus, PresE /a/ is the result of qualita- 
tive changes greater than those which have affected any other short 
vowel. Further progress into the front region is likely to be inhibited by 
the presence of the /æ/-/a:/ opposition, unless the lengthening of /z/ 
becomes general, in which case /A/ may fulfil the function of a short 
open front vowel opposed to both /æ:/ and /a:/. 

It is to be noted that some PresE /a/ forms have developed from a 
ME [o:] (flood, blood, done, month, glove, mother). ME [o:] must 
have shortened in time for it to develop, as ME [u] ([o] being similar in 
quality to [u]), to PresE /a/. The words one, none, should regularly 
give PresE */aun, noun/, this being the normal development of ME 
[9:]; this regular sound change is attested into the seventeenth century 
and is retained in the PresE form of the compound alone. The modern 
RP form with unrounded /a/ derives from a shortened, raised ME 
vowel, preceded in the case of one by a glide [w], originally considered a 
vulgarism; the vowel of none results from an analogy with that of one. 

(5) Advice to foreign learners.—Most languages possess a vowel of 
the [a] or [4] type. The English /a/ should be related to this quality and, 
indeed, good results can often be obtained in teaching by transcribing 
the English vowel as /a/ rather than with the traditional /A/ symbol. In 
this way prejudice induced by the frequent orthographic spelling with u 
or o is avoided; if the quality thus obtained is too fronted, it may be 
modified in the direction of /a:/. A proper qualitative distinction 
should be maintained between the vowels in such words as match, 
much, march; ban, bun, barn; hat, hut, heart. Above all, there should 
be no lip-rounding such as might produce a type of front open-rounded 
[œ]. 


7.9.6 1a: 
(1) Examples.—a—pass, after, bath, tomato, father, branch, 
camouflage 


ar—part, car, march 
ear—heart, hearth 
er—clerk, Derby, sergeant 
al—calm, palm, half 
au—aunt, laugh 


(Note: /a:/ in ‘vase’, and in recent borrowings from French in which 
the French -oir [wa:r] is realized in English as /wa:/, e.g. ‘reservoir’.) 


Long—bar, far, farm, large, hard 
Reduced—part, last, raft, lark, arch 
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Fic. 14.—/a:/ and variants. 


Compare [a:], [a] —card, cart; parse, pass; carve, calf; large, larch 
/a:/, /A/—cart, cut; harm, hum; march, much; lark, 
luck; dance, dunce 
[t] following—snarl, gnarled, Charles 


(2) Description.—This normally long vowel is articulated with a 
considerable separation of the jaws and the lips neutrally open; a part 
of the tongue between the centre and back is in the fully open position, 
no contact being made between the rims of the tongue and the upper 
molars. The quality is somewhat nearer to C[a] than to C[a]. Although 
there is a difference of length according to whether it occurs in a syllable 
closed by a fortis or lenis consonant, the shortening effect of a closing 
fortis consonant 1s not as marked as for other long vowels; thus, 
whereas the reduced [i] of beat may be of similar length to the /1/ of Dit 
the reduced [a] of cart is somewhat longer than the short /A/ of cut. 
/a:/ does not normally occur before /n/. 

(3) Variants.—A variety of /a: / retracted near to the quality of C[a] 
is typical of some advanced (refined) RP speakers; a variety of /a:/ 
fronted towards C[a] is also heard among some RP speakers and in 
many regional types of English, e.g. in Australian English. 

Many regional forms of English do not make the RP distinction 
between /z/ and /a:/, using for both a vowel in the region of C[a]. 
These forms of English not having developed historically in the same 
way as RP (see following section), two main word groups show a 
variance from RP system: firstly, those cases of /a: / in RP ina syllable 
closed by /f, 8, s/ or by /m, n/ + consonant (craft, laugh, bath, pass, 
sample, branch); secondly, those words, originally with vowel + /r/, 
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now with /a:/ in RP (cart, hard, large, heart, sergeant). In the first 
group a short front vowel is typical of many regional types of English, 
though some dialects will use a lengthened front vowel; in the second 
group, either [a] or [ee] + [r] is used or a lengthened [a:] or [ze:]. Thus 
the distinction cat-cart may be [kat]-[kart] or [kat]-[ka:t], etc. 

In RP, too, there are many cases of indecision between /z/ and /a:/ 
in words where the vowel is followed by /f, 8, ð, s/ or by nasal con- 
sonant + consonant. Thus, /ather, transfer, elastic, plastic, are words 
in which /z/ or /a:/ may be heard. Sometimes the use of /a:/ as 
against the more usual general RP /2/, e.g. in gymnastic, Atlantic, 
denotes either a special, conservative RP or a form of regional speech 
over-normalized towards RP. One of the main vocalic features which, 
in the popular mind, distinguishes RP from much regional speech is the 
presence in the former of the /z/-/a:/ opposition, especially in the 
category of words containing vowel + /f, 8, s/ or nasal consonant + 
consonant. There are, however, in RP many cases where /z/ rather 
than /a:/ occurs in these phonetic contexts: e.g. /a:/ in pass, glass, 
can’t, grant, chance, dance, demand, slander, caster, aghast, but /z/ 
in passage, ass, cant, rant, finance, romance, expand, random, aster, 
gas. 

(4) Chief sources.—Of the many sources of RP /a:/, the following 
are the most important: (a) through loss of post-vocalic /r/ in the eigh- 
teenth century, short [a] or [æ] > /a:/ (charm, march); the [a] or [ee] in 
question may often result from ME [e] (far, star, heart) as well as from 
an earlier French [e] (farm, clerk, sergeant); (b) lengthening of [a] or 
[æ] > /a:/, due to the following fricative (especially /f, 8, s/), inci- 
pient at the end of the seventeenth century (staff, after; path, bath; 
pass, ask, cast; father, rather); (c) reduction of OF [a] > ME [au] > 
[0:] > /a:/ (aunt, branch, command, chant); (d) reduction of ME 
[au] and late ME loss of [t] > /a:/ (half, calf, palm, balm); (e) appro- 
ximations of foreign values in more recent borrowings—mainly from 
French and Italian (charade, moustache, memoir, sonata, tomato, 
drama, saga). 

It should also be noted that a new /a:/ is appearing in PresE, result- 
ing from the levelling of the sequences [aia] and [aua] (fire, tower)—see 
eae 

(5) Advice to foreign learners.—Many languages do not have a qua- 
litative opposition, in the relatively open region, of the English 
/z/-/a:/ type. The retracted nature of RP /a:/ should be insisted 
upon, especially in those words of the after, path, pass, chance cate- 
gories. This retraction may be achieved by modifying /a:/ in the 
direction of /v/ (see §7.9.7) and comparing such pairs as cart, cot; calf, 
cough; lark, lock; large, lodge. 

In addition, in the case of words in which /a: / is shown in the spelling 
by vowel letter + r, the temptation to pronounce any kind of [r] 
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should be overcome (except when word final r may link to a following 
word beginning with a vowel). It is helpful to consider such post-vocalic 
r letters simply as a mark of length for the preceding vowel. French 
learners should be careful not to use undue nasalization in words of 
French origin which suggest modern French forms, e.g. branch, plant, 
etc. 


7.9.7 /D/ 


(1) Examples.—o—dock, dog, holiday, sorry, gone 
a—was, what, swan, want, watch, quality 
ou, ow—cough, trough, Gloucester, knowledge 
au—because, sausage, laurel, Austria, Australia, 
cauliflower 


(Note: /o/ in ‘yacht’ /jot/.) 


Compare /v/, /a:/—lodge, large; cot, cart; cough, calf; impossible, 
impassable 
/0/, /o:/—cod, cord; don, dawn; stock, stalk 
[H following—doll, involve, revolver, solve 


a 
FIG..15)—7 07 and varianis: 


(2) Description.—This short vowel is articulated with wide open 
jaws and slight, open lip-rounding; the back of the tongue 1s in the fully 
open position, no contact being made between the tongue and the upper 
molars. The quality is that of an open lip-rounded C [a], i.e. secondary 
C[p], /0/ does not occur in a final, open syllable. 

(3) Variants.—The type of /D/ described above has a very slight 
degree of lip-rounding. Some varieties of /p/ (notably those of south- 
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west England and American English) have no lip-rounding and a 
tongue raising often somewhat advanced from true back. There is, 
therefore, considerable qualitative similarity between this kind of /p/ 
and RP /a:/. The phonemes are kept distinct either through a complex 
of length and quality, e.g. cough, calf being distinguished by the length 
and fronted nature of /a:/, or through the pronunciation of post- 
vocalic r (vowel + r being the most common spelling of RP /a:/), e.g. 
in dock, dark or lodge, large. 

Many words containing /0/ + /f, 8, s/ have an alternative pro- 
nunciation with /9:/, e.g. off, cloth, cross. Such a variant is typical of 
conservative RP and had a social prestige value in southern England, 
but is generally now replaced by /D/. This shift away from the tradi- 
tional /3: / in such contexts may be due to the fact that /o:/ is also 
typical of popular London speech (Cockney), which uses /2:/ in these 
situations and also in such words as dog, gone, etc. There is some varia- 
tion, too, in certain words spelt with au or a, both /p/ and /2:/ being 
heard, e.g. in Australia, Austria, fault, salt—the first two being more 
commonly with /D/, the last two more commonly with /2:/. 

It is noticeable that in sequences /vl/ + /C/ the pronunciation /23ul/ 
+ /C/ is becoming more common, especially when /v/ is the final con- 
sonant, e.g. ininvolve, evolve, etc. This is partly due to pressure exerted 
by the fact that /aul/ sequences (roll, bowl, soul, mole, etc.) are con- 
siderably more frequent than /pvl/ (doll, loll, col), and that /vl/ + /C/ 
sequences are themselves rare, e.g. lolled, dolls, golf, solve; salt, fault 
are more often said with /o:/ than with /v/, volt more commonly with 
/əu/ than with /v/. In addition, the replacement of traditional /p/ by 
/30/ before /IC/ may be due in the London region to an analogous 
hypercorrection of the popular allophone [pu] for /au/ before [ft] (see 
§7.10.4). 

In Scottish English the /p/-/o:/ distinction is often not made, cot 
and caught both having a vowel in the region of C [9] and cord being 
kept separate from cod by the additional distinction provided by the 
pronunciation of post-vocalic r. 

(4) Chief sources.—PresE /D/ derives regularly from OE [o] (dog, 
cock, song, long) and OF [9] (offer, lodge, jolly), and occasionally from 
shortened OE [o:] (blossom, soft). It is likely that the short [9] of OE 
and ME was somewhat closer than PresE [o], the more open type of 
articulation having been reached at the end of the eModE period. 
Between the sixteenth and eighteenth centuries, many words, having [9] 
in ME, were pronounced with [a] or [æ], e.g. in God, strop, stop, plot. 
This pronunciation, typical during those centuries of both fashionable 
and vulgar London speech, is now retained only in the forms strap and 
Gad (exclamation). 

ME short [a] preceded by [w] is often rounded to [o] (what, watch, 
was, want, quality), a change which appears to have become established 
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during the seventeenth century. The rounding has not taken place when 
a velar consonant follows, e.g. in wag, wax, swagger, twang, nor, in 
some pronunciations, when /f/ follows, e.g. quaff, waft (/kwa:f, 
wa:ft/), nor in the word swam. 

Shortening of ME [9:] has taken place in such words as gone, shone 
(though the regular development with [o:] was still used in the seven- 
teenth century); monophthongization and shortening has affected the 
ME forms of knowledge, sausage, etc. The lengthened form of ME [>] 
before /f, 8, s/ appears in the second half of the seventeenth century 
and has existed beside the short /p/ ever since, though it seems to be 
losing ground to-day. /b/ + nasal consonant occasionally represents 
[a] in recent French borrowings, e.g. restaurant, fiancé /'restront, 
fi pnse1/. 

(5) Advice to foreign learners.—Short back open vowels occurring 
in other languages often differ from the English /v/ in that either they 
are somewhat closer or more centralized, or are pronounced with 
stronger lip-rounding. The extremely open nature of the English vowel 
can be emphasized by relating it to /a:/. Words such as part, large, 
calf, should be said as far back in the mouth as possible, with the jaws 
wide apart and only the very slightest lip-rounding. In this way, a 
quality near to the /v/ of pot, lodge, cough, may be obtained. 


7.9.8 FOV 


(1) Examples.—or—cord, horse, sword, born 
aw—saw, lawn, jaw, yawn 
ou, au—bought, ought, daughter, fault, cause 
a—all, talk, salt, water, war, quart 
ore, oor, oar, our—before, more, door, floor, oar, 
board, court, four 


(Note: /3:/ in ‘broad, sure’—or with /ua/ in the second word.) 


Long [9:]—saw, war, born, board, dawn 

Reduced [9] —sort, ought, horse, chalk, quart 

Compare [5:], [9] —saw, sort; war, wart; board, bought; saws, sauce 
/0/, /9:/—cod, cord; don, dawn; stock, stork 
/U/, /d:/—put, port; could, cord; bull, ball 
/əU/, /9:/—code, cord; cold, called; bone, born 

[t] following—all, ball, bald, walled, halt, false 


(2) Description.—This relatively long RP vowel is articulated with 
medium lip-rounding; the back of the tongue is raised between the half- 
open and half-close positions, no contact being made between the 
tongue and the upper molars. The quality lies between C [o] and C[o], 
1.e. [9] or [9]. /9:/ does not normally occur before /n/. 
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FIG. 16.—/2:/ and variants. 


(3) Variants.—The traditional relationship between /2:/ and /v/, 
i.e. one in which the opposition is mainly a matter of quantity (as in 
offal—awful, cot—caught), appears to be lost in PresE general RP. 
The quality of /9:/ is now closer, so that, in advanced RP, it often 
approaches C[o]. Thus, if the word cord were to be said in this style of 
speech with an unusually short /9:/, it would resemble could. More- 
over, /o: / increasingly replaces earlier [9a] forms in the words spelt with 
ore, oor, oar, our (before, door, soar, four), though [9a] is retained 
both in conservative RP and in many regional dialects. Alter- 
natively, those words containing r may have in regional types of speech 
[9] or [o] + /r/. Even when the pronunciation of r has been reduced to 
[a], this distinction of the preceding vowel may persist, e.g. horse /hors/ 
or /həəs/, hoarse, /hors/ or /hoas/; born /born/ or /boen/, borne 
/born/ or /boan/. In RP some [əə] forms < /ua/ have a form with 
/9:/, e.g. sure, your (also often you’re), poor. 

In popular London speech /9:/ is often realized in open syllables as 
[9: wa], both in cases deriving from original vowel + /r/ (door, four) 
and also in those where no [r] has previously existed (saw, law); a 
diphthong of the type [pu] or [ou] is often used before a consonant, not 
only before [t] (fall, ball) but also in such words as caught, daughter, 
born, horse, talk, board, etc. 

(4) Chief sources.—The tendency of PresE RP /9:/ to become closer 
is a continuation of a historical closing process. Many instances of 
modern /9:/ result from a ME diphthong [au], which is likely to have 
had a variant [a:]. This diphthong or open vowel derives in turn from: 
OE open vowel + velar or labial consonant sequences, e.g. [a] + [w] 
(thaw), [a] + [y] (aw), [e:] + [x] (taught), [a] + [v] (hawk < hafoc); 
an earlier OE [a] with [vu] glide before [t] (al!l, call, fall) and with later 
(approximately seventeenth-century) loss of [t] (talk, walk); OF 
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[4] or OF [a] + [u] or [o] sequences (cause, autumn, sauce, haunt, 
lawn) or sometimes as a reduction of [a] + [v] (saunter, laundry). ME 
diphthong [du] is also often realized in PresE /o:/; this ME form 
derives from OE [o] + [x] (bought, wrought), from OE [o:] + [x] 
(thought, brought, daughter) and from OE [a:] + [x] (ought). 

In addition, in the same way that ME [a] preceded by [w] generally 
gives modern /b/, [w] has also rounded ME [a] or [a:] to /o:/ during 
the seventeenth century (water, quart, warn, warm). Finally, a large 
number of instances of RP /o:/ result from the loss of post-vocalic /r/ 
in the eighteenth century (horse, lord, source, forth) via such stages as 
[əə] or [09]. This change has levelled several ME vowel distinctions, e.g. 
ME [o] + /r/ (short, horse), ME [9:],+ /r/ (force, board), and ME 
[o:] or [u:] + /r/ (sword, fourth, floor, mourn, pour); such lowering 
and monophthongization of earlier [us] is also extended by some 
speakers to such words as sure, poor. Vowel distinctions preceding 
retained /r/ or its replacement are still kept in regional dialects (see (3) 
above). 

(5) Advice to foreign learners.—In many countries a type of /9:/ is 
taught which is rather more open than the general RP variety described 
above and which can no longer be said to be typical of RP. The slightly 
higher tongue position should be accompanied by closer lip-rounding. 
Many languages have a vowel in the region of C[o]; this latter sound 
may serve as a Starting point for acquiring RP /9:/, the tongue and lip 
positions being relaxed until the correct quality is reached. 

The spelling forms of /9:/ often cause difficulty. No r should be pro- 
nounced where it occurs in the spelling of such words as port, sort, lord, 
more, except when, in a word final position, it is used as a link witha 
following word beginning with a vowel, e.g. pour out /'po:r 'aut/. 
Words having /o:/ and spelt with au, aw, ou, e.g. taught, saw, ought, 
are often wrongly given a [du] or [ou] type of diphthong. The mono- 
phthongal nature of /9:/ should be insisted upon, especial care being 
taken to keep a proper distinction between /9:/ and /əu/ in such pairs 
as caught, coat; saw, So. 


Toe lu/ 


(1) Examples. —u—put, full, sugar, cushion, butcher 
o—wolf, woman, bosom 
oo—good, book, wood, wool 
ou—could, should, would, courier 


(Note: ‘Worcester’ /'wusto/, ‘worsted’ (cloth) /'wustid/.) 


Compare /u/, /u:/—full, fool; wood, wooed 
/0/, /o:/—could, cord; wood, ward 
[t] following—full, pull, wool, wolf 
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(2) Description.—The short RP vowel /u/ is pronounced with a part 
of the tongue nearer to centre than to back raised just above the half- 
close position; it has, therefore, a symmetrical back relationship with 
the front vowel /1/; the tongue is laxly held (compared with the tenser 
/u:/), no firm contact being made between the tongue and the upper 
molars. The lips are closely but loosely rounded. The quality is that ofa 
centralized C[o] = [6]. This vowel occurs in both accented and 
unaccented syllables, being present in the accented syllable of a rela- 
tively small number of words, though some of these are of common 
occurrence, e.g. put, good, look, would, etc. /U/ does not occur in 
word initial positions nor before final /n/ and finally only in the 
unaccented form of fo /tu/. 

(3) Variants.—Little striking variety is found in RP realizations of 
/U/. Some speakers use less lip-rounding and a lower tongue position 
than that described above, notably in the common word good [g6d] or 
[g¥d], and also in should, could and, to a lesser extent, would. (The 
unaccented forms of these last three words very often have /9/ rather 
than /U/.) In some words there is a variation between /u/ and /u:/, e.g. 
room, groom, broom, tooth, the commoner phoneme being /u:/. A 
form of [Y] represents /U/ in some northern regional speech, e.g. 
butcher ['b¥tfa]; and again, in northern speech, many words spelt with 
oo have /u:/, e.g. cookery book. In Scotland the opposition /U/-/u:/ 
may be neutralized, a centralized [ti] being used for both, so that pull 
and pool have a similar vowel quality. 

(4) Chief sources.—As we have seen ME short [u] has a regular 
development to PresE /a/. A number of cases of ME [u], however, 
whether from OE sources (full, bull, wolf, wool, wood) or from OF 
sources (push, butcher, pulley), have retained their [u] quality; the 
presence of a preceding labial consonant may be said to account for the 
retention of a lip-rounded vowel, but there are several cases where /A/ 
has developed despite a labial consonant (butter, bud, pulse). In 
another group of words, PresE /u/ derives from a ME [o:] which regu- 
larly gives PresE /u:/ (food, moon); such words are good, foot, stood, 
book, look, for which alternative pronunciations with /u:/ or /U/ 
existed into the seventeenth century. The northern pronunciation with 
/u:/ in many such words (mentioned above) retains the closer form, 
abandoned in the London region. 

(5) Advice to foreign learners.—The difficulty of /U/ 1s similar to 
that of /1/, i.e. just as the vowels /1:/, /1/, presented three oppositions 
involving complexes of quality and quantity, [i:]-fi]-[1], so /u/ has to be 
distinguished from /u:/ sometimes by quality alone ( foot-boot), some- 
times by quality and quantity (good-food). The quality of /u/ must be 
kept quite distinct from that of the reduced form of /u:/; if a vowel of 
the quality of C[o] occurs in the learner’s own language, this may be 
used as a starting point for learning English /u/—essentially a 
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FIG. 17.—/vu/ and variants. 


centralized [o]. Thus, in the case of French learners, for instance, the 
vowel in foot may be usefully related to the French vowel in faute and 
the English vowel acquired by relaxing the whole articulation. Relating 
/u/ and C[o] in this way underlines the fact that /u/ is not a kind of [u] 
sound. If the centralization of /u/ is not sufficient, the starting point 
may be a central [ə] modified in the direction of [o]. The opposition 
between /u/ and fully long /u:/ is less difficult once the distinction 
/u/-reduced /u: / is established (see following section for comparative 
exercises). 


7.9.10 AN bey 


(1) Examples.—oo—food, soon, moon, spoon 
o—do, who, move, lose 
ou—group, soup, wound (n.), through 
u—rude, June, Susan 
ew, ue, ul, oe—chew, blue, juice, shoe 


(Note: in many cases of the spelling u, eu, ew, ue, ui, /u:/ is preceded 
by /j/, e.g. ‘music, duke, neuter, new, few, hue, argue, nuisance, 
beauty’; in some words, both /u:/ and /ju:/ are heard, e.g. ‘suit, 
enthusiasm’.) 


Long {u: ]—two, blue, food, move 
Reduced [u] —boot, fruit, hoof, group, douche, hoop 
Compare {u:], [uJ—shoe, shoot; rude, root; lose, loose; use (v.), 
use (n.); nude, newt; Jews, juice 
[u:], /u/—food, good; pool, pull 
[u], /u/—boot, foot; loop, look 
[t] following—cool, rule, schools, fooled 


(2) Description.—RP long /u:/ is a back close vowel, but the tongue 
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Fic. 18.—/u:/ and variants. 


raising is relaxed from the closest position and is somewhat advanced 
from true back; its relationship with /u/ is similar to that between /i:/ 
and /1/, the articulation of /u:/ being tense compared with that of /U/, 
though no firm contact is made between the tongue and the upper 
molars. The lips tend to be closely rounded. The quality is that of a 
relaxed, slightly lowered and centralized C[u]. /u: / does not normally 
occur before /1/. 

(3) Variants. —The absence in English of any opposition between 
/u:/ and a vowel of the front, close rounded type, [y], is an important 
reason for the relaxation and fronting of this phoneme from a true back 
position. Considerable centralization, [ü], appears increasingly to be a 
feature of certain types of RP, especially in the London region. In this 
latter area, centralization is a characteristic of the regional dialect, both 
popular and modified. Extreme centralization amongst RP speakers is, 
therefore, inhibited to a certain extent for social reasons. In those cases 
where /u:/ is preceded by /j/ (new, beauty), the palatal nature of /j/ 
normally entails considerable centralization of /u:/. A type of [ti] 1s 
also to be heard in many forms of Scottish English as a realization of 
RP /u:/ (and /u/). 

Just as RP /i:/ is rarely pure, so RP /u:/ is usually diphthongized, 
[uu] or [uw], especially in final positions (do, shoe, who). Any exag- 
geration of the diphthong, e.g. [ati] or [iti], with total loss of lip- 
rounding on the first element (or, occasionally, on both elements) is 
typical of popular (Cockney) London speech. On the other hand, a 
quality of /u:/ which is too near to a pure C[u], with strong lip- 
rounding, is characterized as affected or over-cultivated. 

(4) Chief sources.—PresE /u:/ derives regularly via ME [o:] from 
OE [o:] (doom, soon, to, tool, goose). Scandinavian [o:] (root, boon) 
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and OE [a:] preceded by [w] (womb, whom, two); also from OF [o:] 
(fool, prove, proof). o and oo are, therefore, the typical spelling forms 
of the regular development to [u:], which was reached by about 1500. 
Certain French words with [u:] (ou spelling), introduced after the 
change [u:] > [au] had begun, i.e. after about 1400, retained their [uz] 
quality (route, routine, group, soup). In many cases, too, especially in 
those words where the French stress-accent on the last syllable has been 
kept, French -on (= probably [0]) has given English /u:/ with a 
spelling change to oo (platoon, balloon, saloon, typhoon). 

PresE /ju:/ derives from many sources. The chief of these are: 
(a) ME [iu], [eu] or [y:] < OE [1] + [w] (Tuesday, hue), < OE [e:a] + 
[w] (you, knew), < OF [iu] (adieu, lieu), < OF [eu] (due, view), < OF 
[y] (duke, accuse, deluge), < OF [ui] (suit, pew, nuisance). It is likely 
that in these cases a ME falling diphthong [iu] changed the relative pro- 
minence of its elements to give [ju: ] in the sixteenth century (a variant 
with [y: ] perhaps remaining in some kinds of English). (b) ME [eu] < 
OE [e:a] or [e:] + [w] (few, dew, hew), and < later French 
(post-1400) [eu] (feud, neuter). This ME diphthong closed to [eu] by 
eModE (still being kept separate from [ju: ]) and finally coalesces with 
[ju: ] in the late seventeenth century. 

(5) Advice to foreign learners.—The quality of this vowel should 
cause no difficulty to most learners, many of whom will have a close 
back rounded vowel in their own language. A pure vowel of this kind 
will usually be suitable in English, though too energetic lip-rounding 
should be avoided. The typical RP centralization or diphthongization 
should be imitated only with caution, since any exaggeration of the 
movement will produce an effect which is judged vulgar or dialectal. 
The centralization of /u: / following /j/ need not be consciously aimed 
at. Those learners, such as Norwegians, who have a centralized [ti] in 
their own languages, should avoid using this sound in English because 
of its dialectal connotations; and those whose own close back vowel is 
unrounded, as in Japanese, should add fairly close lip-rounding. 

More difficult is the relationship of fully Jong [u:], reduced fu] and 
short [u] as in food, boot, and foot. It should be noted, for instance, 
that use (v.) [ju:z] differs from use (n.) [jus] more by the length of the 
vowel than by the quality of the final consonant, and that the difference 
between the vowels of boot ([u]) and of foot ([u]) lies more in their 
quality than in their length. 


7.9.11 eve d 


(1) Examples.—ir, yr—bird, first, girl, myrtle 
er, err, ear—her, serve, err, earth, heard 
ur, urr—turn, church, nurse, purr 
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w + or—word, world, work, worse 
our—journey, courtesy, scourge 


(Note. /3:/ in ‘colonel’ /'k3:n1/.) 


Long [s:]—fur, burn, bird, urge 

Reduced [3]—first, earth, worse, church 

Compare [3:], [3] —cur, curt; heard, hurt; surge, search; purrs, 
purse; Thursday, thirsty; serve, surf 

[H following—earl, curl, world, girls 


(2) Description. —RP /3:/ is articulated with the centre of the 
tongue raised between half-close and half-open, no firm contact being 
made between the tongue and upper molars; the lips are neutrally 
spread. The quality is, therefore, remote from all peripheral Cardinal 
vowel values. 

The quality of /3:/ often coincides with that of unaccented /9/, both 
being central vowels. It is possible to treat /9/ as an unaccented allo- 
phone of /3:/, since it may be claimed that no true opposition between 
the vowels exists. Thus, in the pair foreword /fo:w3:d/, forward 
/'fo:wad/, the second syllable of foreword may be said to carry a 
secondary accent. In any case, there is a difference of quantity between 
/3:/ (not carrying a primary accent) and unaccented /9/, the /3:/ of 
foreword being longer than the /9/ of forward; or again, the relatively 
unaccented, reduced /3:/ of commerce is longer than the /3/ of 
commas—[komas] v. /'komaz/. It is to be noted that /3:/ frequently is 
reduced to /9/ when it is associated with no kind of stress-accent, e.g. 
were /'w3:/, but /wa/; amateur /'zema,t3:/, but /‘zemate/; pennyworth 
/‘peni,w3:8/ = ['peni,w36], but /'peniw20/, where the reduced form of 
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FIG. 19.—/3:/ and variants. 
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/3:/ is still longer than unaccented /9/. /3:/ does not normally occur 
before /1/. 

(3) Variants.—/3:/ being the only RP accented vowel in the central 
area, great latitude of degree of tongue raising is permissible, varying 
from a sound in the half-close region or slightly above to one in the half- 
open region or slightly below. Both variants may be heard in RP, 
especially in the conservative type. The closer variety is also typical of 
certain regional speech, e.g. that of Birmingham and Liverpool and, to 
a lesser extent, in some Australian English. A very open kind of /3:/, 
1.e. one below the half-open central region, entails repercussions in the 
phonetic realization of the opposition /3:/-/a:/, if a qualitative dis- 
tinction is to be maintained between two words such as heard, hard. 
Those using the very open variety of /3:/ usually achieve such a 
distinction by pronouncing the most retracted form of /a:/ (see 
§7.9.6(2)). 

In most cases PresE /3:/ has derived from a vowel + /r/, as the spell- 
ings suggest. A pronunciation with vowel (usually [1, e, a] or [u]) + /r/ 
is retained in many types of English where post-vocalic /r/ 1s still pro- 
nounced, e.g. Scottish English and some kinds of northern English; 
such forms of speech do not possess /3:/ as a distinctive sound. Other 
kinds of English, e.g. south-west British and American, retain a post- 
vocalic (usually retroflexed) /r/ but obscure the preceding vowel to [a], 
e.g. bird [bard], or pronounce an [9] sound with tongue retroflexion 
(r-coloured vowel), e.g. [bad] or [bid]. (The same impression of retro- 
flex quality can, in fact, be given without actual retroflexion of the tip 
of the tongue but with contraction of the body of the tongue.) 

The RP, non-retroflexed, /3:/, being essentially central in relation 
to the remainder of the vowel system, can be regarded as having a 
neutral quality in comparison with the other English vowels. It is the 
indeterminate sound often used for hesitation in English; if it is sub- 
stituted in an utterance for all other vowel phonemes, a high degree of 
intelligibility is retained, provided that the appropriate variations of 
quantity remain. 

(4) Chief sources.—The great majority of cases of PresE /3:/ derive 
from: ME [e] + /r/ (virtue = vertue, earth, heard, fern) or ME [1] + 
/¥/ (shirt, birth, myrrh); or ME [u] + /r/ (word, journey, spur). ME 
[€], [1], and [fu] (> [a] with loss of lip-rounding) all centralized before 
final /r/ or /r/ + consonant, so that the pronunciation [or] forall three 
was incipient in the London region in the sixteenth century and general 
in the late seventeenth century, though contemporary grammarians still 
often insisted upon the earlier vowel quality especially in the case of ME 
[e] and [U] words. With the loss of /r/ in post-vocalic positions in the 
eighteenth century, the PresE central long /3:/ was reached. Some 
attempts to reproduce the French [ce:r] in final accented syllables have 
also resulted in /3:/, e.g. connoisseur /,kona's3:/; liqueur /li'k3:/ (also 
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/l'kjuo/); amateur /jemat3:/ (also /‘zemata, 'eemotjo, 'eematfo, 
'eemoa tjua/); chauffeur /fou'f3:/ (also /'faufa/). 

(5) Advice to foreign learners.—It is comparatively rare to find a 
long central vowel such as /3:/ in other languages. Many languages, 
however, possess somewhat centralized front rounded vowels of the [ø] 
and [ce] types. These are quite unacceptable in English because of the 
lip-rounding. An articulation with spread lips should, therefore, be 
insisted upon, keeping if necessary approximately the same lip position 
for such words as fur, bird, learn, as for fee, bead, lean. Lip-spreading 
is particularly important after /w/, e.g. in word, world, work, etc. In 
addition, the quality must be of a central rather than fronted kind, 
though some latitude may be allowed as far as the degree of raising of 
the tongue is concerned. 

Since nearly all cases of /3:/ occur in words having an r in the spell- 
ing, care must also be taken to avoid post-vocalic /r/ (except as a liaison 
form as in stir up /'st3:r 'ap/) or any retroflexion of the tongue such as 
would produce r-colouring. 


po 12 /9/ 


(1) Examples.—/a/ may be spelt with most vowel letters and their 
combinations, e.g. / (possible), e (gentlemen), a (woman), o (oblige), u 
(suppose), ar (particular), er (mother), or (doctor), ou (famous), our 
(colour), ure (figure), etc. It is most frequently in opposition either with 
zero vowel, e.g. about, bout; waiter, wait, or with unaccented /1/, e.g. 
affect, effect; accept, except; razors, raises; grocers, grosses; mitre, 
mighty; waiter, weighty; sitter, city; battered, batted. 

In addition, it should be noted that /o/ is normal in common 
unaccented (weak) forms of such words as a, an, the, to, for, but, and, 
etc. (see $10.4). 

(2) Description.—/a/ has a very high frequency of occurrence in 
unaccented syllables. Its quality is that of a central vowel with neutral 
lip position, having in non-final positions a tongue-raising between 
half-open and half-close, e.g. in ‘glone, fatigue, decorative, after- 
wards’, etc.; in the vicinity of the velar consonants /k, g/ and /n/, 
however, the tongue may be slightly more raised and retracted, e.g. 
‘long ago” /'lon a gau/. But in final positions, e.g. in ‘mother, doctor, 
over, picture, China’, the vowel may be articulated either in the half- 
open central position or in the most open region of the central area. The 
acoustic formants of /a/ are, therefore, likely to be similar to those for 
/3:/ or /A/ according to the situation. 

(3) Variants.—As is the case for /3:/, /a/ has no qualitative opposi- 
tion within the central area of vowel articulation, so that considerable 
variation is possible within this region. In particular, as has been stated, 
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—— 9 (adjacent to velars) 


—— non-final a 


-> final ə 





FIG. 20.—/a/ and variants. 


the quality of final /a/ tends to be of a more open kind. In certain types 
of RP, both conservative and advanced, this variety reaches an arti- 
culatory area similar to that associated with /a/ (i.e. a slightly raised, 
centralized front open vowel), but may have the same degree of opening 
as /a:/ (i.e. fully open), e.g. the final vowel of mother /'mado/ may be 
more open than the first; or again, the two vowels of father /'fa:d9/ 
may be of similar quality, cases of final /a:/ which might contrast with 
/a/ being rare. The opening of final /9/ to this extent is, however, com- 
monly felt to be an exaggeration characteristic of affected speech. 

In those kinds of English where post-vocalic /r/ 1s still pronounced, 
the cases of RP /9/ represented in the spelling by vowel + rare realized 
as retracted [i] or [A] + /r/,e.g. in Scottish English, or as [ə] + [1] or as 
syllabic [1], e.g. some kinds of American and south-west British 
English. Because of the presence of an articulation corresponding to the 
r spelling in these types of English, words such as finer and China 
(rhymes in RP) are phonetically different in their final syllable. 

(4) Chief sources. —As the great variety of spellings indicates, /9/ 
may represent the reduced (obscured, ‘schwa’) form of any vowel or 
diphthong in an unaccented position. This reduction of unaccented 
vowels, typical of a stress-accent language such as English, has been a 
feature of the English sound system for over a thousand years. Since 
our spelling of vowels is based on the Latin vowel letters, our written 
language has always concealed these obscurations of quality. But it is 
evident from confusions in the use of vowel letters that such reductions 
were taking placc in unaccented syllables even in OE. Thus, OE letters 
#, e, i, in unaccentcd syllable are vcry often confused, probably 
indicating a sound {ë}; o and u, too, are often interchanged, though 
remaining scparatc from a representing [a]. By the eleventh century all 
thesc major distinctions of weak [e], [u], and [a] tend to be confused, 
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especially in unaccented final syllables, no doubt with an obscuration in 
the direction of [ə]. This tendency continued in the ME period, so that it 
has been suggested! that by the middle of the fifteenth century the 
vowels of unaccented syllables showed much the same kind of obscura- 
tion (towards [a] or [i]) as in PresE. Thus, fifteenth-century spellings 
such as disabey, Bishap, tenne a clocke, sapose, seem to indicate a 
vowel of the [a] type. 

Accentual patterns in ME, however, especially in respect of French 
words, were not always the same as now. A word such as adversary 
would, in ME, have a double accent [adver'sa:ri(a)], with a full vowel 
[a:] in the penultimate syllable, which now, being unaccented, has [9] 
or no vowel at all. Moreover, a secondary accent (following the main 
accent) was kept in many words into the seventeenth century (cf. 
modern American English), thus inhibiting obscuration towards [ə], 
e.g. temporary (eModE [tempa,re:r1], Pres American [tempa,rer1)), 
emperor (eModE [‘empa,ru:r]). But, generally speaking, Shakespeare’s 
English can be taken as having [9] where PresE has it, e.g. in such words 
as second, among, palate, father, colour, vulgar, measure (in these last 
cases [9] + [1]), and even in the last syllable of follow (cf. fellow witha 
modern variant /'fela/), where to-day the diphthong is kept in educated 
speech. Sometimes, however, eModE has [1] where in PresE /3/ is more 
general, e.g. pigeon, stomach, reason, squirrel. 

(5) Advice to foreign learners.—The quality of this vowel, including 
the two main allophones described in (2) above, does not usually 
present difficulties to the foreign learner, provided that he remembers 
that English /9/ has no lip-rounding and is extremely short. Moreover, 
when /9/ is spelt with vowel + r, the learner should avoid pronouncing 
any kind of [r] sound, except when in final positions an /r/ is pro- 
nounced as a link to a following word beginning with a vowel, e.g. 
father and mother /‘fa:dar ən 'mado/. 

In particular, the learner should note those syllables of a word con- 
taining /9/, remembering that /9/ is a sound which occurs very fre- 
quently in English and that correct obscuration of the unaccented 
syllables of a word is as much a part of the word’s accentual pattern as 
the stress expended on the accented syllables. In this connection, the 
learner may gain greater familiarity with the occurrence of /a/ by 
reading English texts transcribed phonetically and by himself making a 
phonetic transcription of connected English. 


7.10 Diphthongal Vowel Glides 


The sequences of vocalic elements included under the term ‘diphthong’ 
are those which form a glide within one syllable. They may be said to 


1H. C. Wyld, History of Modern Colloquial English, p. 258. 
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have a 1st element (the starting point) and a 2nd element (the point in 
the direction of which the glide is made). The RP diphthongs have as 
their 1st element sounds in the general region of [1, e, a, 3, u] and for 
their 2nd element [1, u, ə]; these elements may be treated as separate 
phonemic entities (see §7.2, ef seq.). The following generalizations refer 
to all the RP diphthongs: — 

(1) Most of the length and stress associated with the glide is con- 
centrated on the Ist element, the 2nd element being only lightly sounded 
(see §§7.12.1, 7.12.3, for the exceptional cases of /13, Ua/); diphthongs 
of this type are said to be ‘falling’. 

(2) They are equivalent in length to the long (pure) vowels and are 
subject to the same variations of quantity, e.g. plays [ple:1z], place 
[pleis]. The reduced forms show a considerable shortening of the Ist 
element. 

(3) They are particularly susceptible to variation in different 
regional and social types of speech. Even within the RP varieties, con- 
siderable variation is possible in both elements. For this reason, and 
because these diphthongs consist largely of articulatory movement, 
‘typical’ formants are not given, though the glides may be identified 
acoustically in terms of the formant structure of the Ist element (witha 
relatively steady state) and that of the 2nd element. 


(4) No diphthong occurs before /n/, except where word final /n/ is 
assimilated to /n/ in connected speech (see §11.3.4). 

(5) With the exception of /91/, the RP diphthongs often derive from 
earlier pure vowels. 


7.10.1 yee 


(1) Examples.—a—ape, late, make, lady, waste, base 
al, ay—day, may, waist, rail, aim, rain 
ei, ey— eight, veil, weigh, rein, they, whey 
ea—great, steak, break 


(Note: ‘halfpenny’ /‘heipni/, ‘gauge’ /geid3/, ‘gaol’ /dzeil/.) 


Long [e:1]—day, made, game, gaze 

Reduced [e1]|— eight, late, face, safe, ache 

Compare [e:1], [e1]—played, plate; ray, race; way, waist; save, safe 
/e/, /e1/—bet, bate; fell, fail; chess, chase; west, waist 

[t] following—male, pail, failed, sails 


(2) Description.—The glide begins from slightly below the half-close 
front position and moves in the direction of RP /1/, there being a slight 
closing movement of the lower jaw; the lips are spread. The starting 
point is, therefore, C[e] (somewhat closer than RP /e/ of bet). Before 
[t], the [1] element is often absorbed into the [ə] or [u] glide on to [1], e.g. 
sail (set). 
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FIG. 21.—-/e1/ and variants. 


(3) Variants.—In RP the only diphthong in the front region (other 
than the diphthongized version of /i:/-[11]) with which /el/ is in 
contrast is /ai/. The 1st element has, therefore, considerable latitude of 
articulation (especially between the half-close and half-open positions) 
before it risks confusion with the fully open 1st element of /al/. In 
general RP starting points of the type C [e, e, e, e] and a centralized [ë] 
are all found. In some regional speech, however, especially in popular 
London dialect, the Ist element may be as open as [ee] or C [a] or a 
sound similar to that used for RP /a/. In such cases, since confusion 
with RP /ai/ would be likely, the realization of /al/ has a more 
retracted Ist element (C [a] or [b)), so that fate [fait] is kept distinct 
from fight [fait], [fort]. The use of such a ‘wide’ diphthong as [a1] or 
[æ] for RP /e1/ is considered unacceptable for social reasons. Many 
RP speakers react against the popular ‘wide’ realization by using the 
closest and ‘narrowest’ variety of /e1/. In advanced RP, however, there 
may belittle or no vocalic glide in the realization of this phoneme, espe- 
cially in the fully long allophone, e.g day, game, made, with [e:] or [e:]; 
this monophthongized form may also be heard in cases where, for 
rhythmic reasons, the quantity is somewhat reduced, e.g. lady, nature, 
relation, with [e]. 

A long monophthong of a type varying between C [e:] and [e:] is also 
to be heard in many British regional dialects as a form corresponding to 
the diphthongal RP /e1/. 

(4) Chief sources. —The main sources of PresE /e1/ are ME [a:] and 
[ee1] or [e1]. Of these, ME [a:] develops from: OE [ee] or [a] lengthened 
in an open syllable (name, ape, raven, ale) or from Scandinavian words 
with a similar vowel (gate, take); OF [a:] (male, nature, cave, state); 
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earlier OF [au] (chamber, change, strange, safe). This ME [a:] was 
closed in the eModE period to [e:] and, perhaps, in some pronuncia- 
tions, to [e:]. (Note the pronunciation of such words as great, steak, 
break—with ME [e:]—which have /e1/ in PresE instead of the more 
usual /1:/ deriving from this ME vowel.) 

ME [ei] or [ei] develops from: OE [e] + [j] (day, again); OE [e :] + 
[j] (grey, clay); OF [ai] (pay, chain); OE [e] + D] (way, play); 
Scandinavian [ei] (they, swain), OF [ei] (faith, obey). It will be seen that 
the ME diphthong of the type [ei] results from a coalescence of two 
earlier distinct diphthongs of the types [æi], [ai] and [ei], [ei], from both 
English and French sources. In many cases the modern spelling reflects 
the origins, e.g. vain as against vein, but often the spelling has been 
changed, e.g. way < [ei], grey < [ei]. The coalescence of the two 
earlier diphthongs took place at the end of the ME period, producing a 
glide of the [ei] or [æi] type. The evidence suggests that this glide was 
monophthongized in the eModE period, though the diphthongal pro- 
nunciation continued to be recommended by the earlier grammarians. 
The new monophthong was of the [€ :] type, thus coalescing with [e:] < 
ME [a:]. 

In the seventeenth century, therefore, a pure vowel [e :] was probably 
used in both classes of words, though careful speakers might still use a 
diphthong in the second group. The general diphthongization of this 
[e:] is likely to have begun in the eighteenth century, developing 
towards a glide of the present type. 

In addition, a number of words of more recent importation to-day 
have /e1/ in imitation of the French [e] or [£], e.g. fiancé, soirée, ballet, 
bouquet, beige, crépe. 

(5) Advice to foreign learners.—Foreign learners should give suffi- 
cient length to the first element of this diphthong, making the correct 
reduction of quantity in the appropriate contexts. Care should also be 
taken that the quality remains within the permitted RP limits, i.e. 
preferably slightly more open than C[e] and not as open as C[e]. The 
second part of the diphthong should be only lightly touched on and 
should never reach the region of fully close [1]. Those who do not havea 
diphthong in this area in their own language should avoid substituting 
for it either [e:] or [e:]; the lower jaw should make an appreciable 
closing movement. 


7.10.2 VENA 


(1) Examples.—I, y—time, write, bite, climb, cry, dry, by 
igh, eigh—high, light, fight, might, height 
le, ye—die, lie, pie, tried, dye 
er, ai—either, elder, aisle 


(Note: /ai/ in ‘eye, buy’.) 
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Long [a:1]—fly, die, mine, hide, eyes 

Reduced [ai] —fight, like, ice, ripe 

Compare [a:1], [ai]—tie, tight; tidal, title; eyes, ice; riding, writing 
[t] following—mile, aisle, piles, mild 


(2) Description.—The glide of RP /ai/ begins at a point slightly 
behind the front open position, i.e. C[á], and moves in the direction of 
the position associated with RP /1/, although the tongue is not usually 
raised to a level closer than C [ë]; the glide is much more extensive than 
that of /e1/, the closing movement of the lower jaw being obvious. The 
starting point may be similar to the articulation used in RP /A/ (see 
§7.9.5). The lips change from a neutral to a loosely spread position. 
Before [t] the [1] element 1s often absorbed into the [a] or [u] glide on to 
the [t], e.g. pile [pa: 1]. 

(3) Variants.—Variants commonly used in the realization of this 
diphthong consist mainly in differences of starting point of the glide. 
Since RP /e1/ is realized between the limits [e1] and [e1], /a1/ cannot, 
while remaining contrastive, have a first element closer than C[e]. 
Those RP speakers who use the closest form of /e1/ will probably have 
a type of [ei] glide for /a1/, while those whose /e1/ is nearer to [e1] may 
realize /al/ with a more retracted type of [a] or fronted [a]. In those 
types of pronunciation, e.g. popular London speech, where /el/ is 
realized as [ee1] or [ai], /ai/ must have a very much more retracted Ist 
element, i.e. [a] or [bo]. Again, as for /e1/ in advanced RP speech, there 
is a variety of /ai/ with an extra-long Ist element and very little glide, 
usually long fronted [a:] (or the equivalent of long /A/) with perhaps a 
slight movement to [ë]; such a realization is most commonly heard in 
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Figs 22 ar and variants. 
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those situations where the phoneme is fully long, e.g. die, try, fine, or 
where there is, for rhythmic reasons, a slight reduction of quantity, e.g. 
Friday, libel, climber. 

In some regional speech a considerably centralized Ist element may 
be heard, i.e. [3:1]; in Scottish English, /a1/ may be distinguished from 
a glide starting from a retracted, short [A] as lst element, used especially 
in uninflected word forms, e.g. side with [Al] as against sighed with /al/ 
= Ae |||, 

(4) Chief sources.—PresE /ai/ derives regularly from ME [E], 
whose main sources are: OE [i:] (ice, like, time, life); OE lengthened [1] 
(child, find, wild); OE or Scandinavian [y:] (hide, mice, kind, sky); OE 
[e:] or [e:] + [3] (fly, lie, dye, eye); OE [e:] + [ç] (ight, night); OF [1:] 
(fine, arrive, licence, price). The pure vowel [i:] probably diph- 
thongized by the beginning of the sixteenth century, the Ist element 
following a central route with increasing opening: [11] inthe late ME (cf. 
PresE diphthongization of /i:/), [a1] in the sixteenth and early seven- 
teenth centuries, [Ai] (where [A] has a half-open back to central value) in 
the late seventeenth and eighteenth centuries, and a variety of [a1] from 
the late eighteenth century. In the speech of some speakers even to-day, 
especially among those using a conservative type of RP, a more 
centralized and retracted Ist element of /a1/ may still be heard. 

In the case of the words either and neither, /ai/ exists to-day beside a 
variant with /i:/ deriving from an earlier [e:]. 

(5) Advice to foreign learners.—Apart from observing the proper 
reductions of quantity in syllables closed by a fortis consonant, foreign 
learners should avoid over-retraction of the quality of the Ist element, 
so as to remain within the limits of the RP vowel; a front open starting 
point 1s to be recommended. Care should also be taken not to glide to 
too close a position, 1.e. to the C[i] area, such as is: reached in 
diphthongs of this type in many languages. 


7.10.3 A 
(1) Examples,—oi, oy —boy, toy, noise, voice, boil, point 
(Note: ‘buoy’ /b91/.) 


Long [>:1]—boy, noise, void, coin 

Reduced [91] —voice, joist, joint, choice 
Compare [9:1], [91] —noise, voice; joys, joist 
[H] following —soil, coiled, boils 


(2) Description.—For RP /1/ the tongue glide begins at a point 
between the back half-open and open positions and moves in the direc- 
tion of /1/, generally not reaching a level closer than [ë]. The tongue 
movement extends from back to centralized front, but the range of 
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FIG. 23.—/31/ and variants. 


closing in the glide is not as great as for /ai/; the jaw movement, though 
considerable, may not, therefore, be as marked as in the case of /al/. 
The lips are open rounded for the Ist element, changing to neutral for 
the 2nd. Before [t] the [1] element is often absorbed into the [a] or [u] 
glide on to the [t], e.g. oil [9 :°t]. 

It will be noted that this is the third diphthongal glide towards an [1] 
sound; it is, however, the only glide of this type with a back starting 
point (if the case of [ui], as in ruin, is discounted). To this extent, /91/ 
may be considered asymmetrical in the RP diphthongal system. 

(3) Variants.—The variants of this diphthong are less striking than 
those affecting the diphthongs so far treated. The starting point may 
vary between C[a] (only rarely for some conservative RP speakers) and 
C[o] (popular London). The latter variant is to be related to the 
London realization of /ar/ as [ai]; the 1st element of the diphthong in a 
word such as boy must be closer than in RP in order to maintain the 
contrast with the glide in buy [bai]. In some conservative RP (now 
rare), especially in the traditional pronunciation of the clergy, a con- 
siderably centralized and unrounded Ist element [91] is used. In all 
varieties of /o1/ the quality of the final element rarely reaches the 
position associated with /1/, a sound of the [é] type being more usual, 
while [€] is not uncommon among RP speakers. It will be seen that, 
when the latter final element is preceded by [9], the gliding movement of 
the tongue is very limited. 

(4) Chief sources. —PresE /91/ derives mainly from OF sources with 
[91] (choices, noise), [o1] or [ul] (boil, point). In ME some words, now 
with /o1/, were pronounced with [ui]; a regular development of this 
latter diphthong’s Ist element would result in eModE [vi] or [al]. 
Confusion was, therefore, possible between words containing this glide 
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and those containing [a1] < ME [i:], now with /a1/. We find, in fact, 
that some PresE /21/ words have [v1] in e ModE (boil, coin, point, join) 
and that, in some cases, there is confusion with PresE /al/ words, e.g. 
boil rhyming with bile. Such pronunciations were current in educated 
English until late in the eighteenth century and since then have occurred 
in popular regional forms of speech. 

(5) Advice to foreign learners.—This diphthong does not present 
very great difficulties to foreign learners, provided that, in addition to 
the appropriate variations of quantity, the quality of the Ist element lies 
between the sounds of RP /o:/ and /b/ and that the glide does not 
extend beyond the half-close front level, 1.e. [ë]. 


7.10.4 /30/ 


(1) Examples.—o—so, old, home, both, folk 
oa—oak, road, foal, toast, soap 
oe—toe, doe, sloe, foe, hoe 
ou, ow—soul, though, shoulder, know, blow 


(Note: /30/ in ‘mauve, brooch, beau, sew, shew’ —variant spelling of 
‘show’—‘don’t, won’t’) 


Long [9:u]—go, toe, home, road, pose 
Reduced [av]—goat, rope, oak, post, both 
Compare [a:u], [eu] —robe, rope; toes, toast; grows, gross; road, 
wrote; cold, colt 
/30/, /3:/—foe, fur; own, earn; goal, girl; oath, earth; 
coat, curt; foam, firm 
/20/, /9:/—SO, Saw; pose, pause; bold, bald; load, lord; 
boat, bought; choke, chalk 
/30/, /3:/, /9:/—foe, fur, four; bone, burn, born; woke, 
work, walk; coat, curt, caught; coal, 
curl, call 
[t] following—hole, roll, old, moult, bolt, poles 


(2) Description.—The glide of RP /au/ begins at a central position, 
between half-close and half-open, and moves in the direction of RP 
/U/, there being a slight closing movement of the lower jaw; the lips are 
neutral for the Ist element, but have a tendency to round on the 2nd 
element. The starting point may have a tongue position similar to that 
described for /3:/. 

(3) Variants. —A number of variants of this narrow diphthong are to 
be found within RP. The type described above is that which has in 
recent years become general. A more conservative diphthong, however, 
has its starting point in a more retracted region, [9] or [$] and the whole 
glide is accompanied by increasing lip-rounding. Another variety (of an 
advanced kind and usually characterized as an affectation) has a start- 
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ing point more forward than the central area, i.e. [e] or [ë]. It is also to 
be noted that, in the speech of many speakers of general RP, the Ist 
(central) element is so long that there may arise for a listener a 
confusion between /2U/ and /3:/, especially when [t] follows, e.g. goal, 
girl (the [u] element of /2U/ being confused with the glide on to [f]). 
Though this may be a source of possible confusion for a listener, the 
speaker will often retain a qualitative distinction between /2U/ and 
/3:/, the latter being realized with a very open type of central vowel. 

In popular London speech the diphthong has a more extensive glide, 
the starting point being equivalent to that of a fronted [a] or [Æ] (the 
latter having a degree of pharyngeal constriction which distinguishes it 
from the starting point of the affected version [ëu] mentioned above). 
The glide finishes in a fronted back position between half-close and 
close, usually without lip-rounding, i.e. [wi] or [Y]. In this type of 
speech, widening of the diphthongal glide is necessary, since a narrow 
diphthong of the [au] type is used as a realization of RP /u:/, [soup] 
being dialectal soup and RP soap. In modified London region speech, 
though the starting point may be closer and more central than that of 
the popular form, lip-rounding is again often absent, so that the 2nd 
element resembles [ti]; or lip-rounding may accompany a more fronted 
vowel, i.e. [ati] or [ay]. In London, too, a more open Ist element is often 
heard before [t], e.g. in dole, roll, cold (see 88.7). 

In other parts of Britain there occur for RP /au/ diphthongs of the 
types [0u] and [ou], where the glide begins with a back vowel, or of the 
type [ëy] where the 2nd element is a fronted, rounded close vowel; or 
again, many regional types of speech have a pure long vowel varying 
between C [o:] and C [9°]. 

The resemblance between /2U/ and a central vowel is further revealed 
by the behaviour of /au/ in unaccented syllables, e.g. /fa netiks, 
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diso'be1/ for the more careful forms with /au/ or [ou] or [o]; also the 
familiar pronunciation of fellow as /'fela/ and the vulgarism /‘winda/ 
for window. The reduction of /əu/ to /3/ may produce in colloquial 
speech homophones which are distinct in a more formal style, e.g. 
ferment, foment—both /fa'ment/, hypertension, hypotension—both 
/haipa tenfn/. 

(4) Chief sources.—PresE /au/ derives chiefly from ME [9:] or ME 
[ou]. Of these, ME [9:] has the following main sources: OE [a:] (no, go, 
home, loaf, ghost); OE [ea] or [ee] + [ld] (old, told, cold); OE [o] in 
open syllables (over, open, nose, hope); OF accented [9] (robe, rose, 
coat, toast, gross). This pure vowel had probably reached a quality near 
C[o:] in the sixteenth and seventeenth centuries. 

ME [ou] derives mainly from: OE [az] or fo] + [w] or [y] (know, 
blow, soul, snow, own, dough); OE [o:] + [w] (glow, flow, row). This 
diphthong probably reached the stage [ou] by the seventeenth century, 
with an alternative pronunciation [o:]. The coalescence with [o:] < 
ME [o:] took place in the seventeenth century and the new [o:] was 
diphthongized to [ou] in the eighteenth and nineteenth centuries, to give 
Fres E 7 ou 

/əu/ is also used in PresE in imitation of French [o] or [9] in more 
recent importations such as beau, vaudeville, hotel, bureau. 

(5) Advice to foreign learners.—In some current books dealing with 
the pronunciation of British English, this diphthong is transcribed as 
‘ou’. Since the Ist element is now clearly of a central type, such a 
transcription may be misleading. Foreign learners should avoid starting 
the glide with a truly back vowel, but any kind of front rounded vowel, 
e.g. [o] or [ce] is also wrong. It is advisable to learn /3:/ first and to 
modify /3:/ by adding lip-rounding to the end of the vowel. Thus, fur 
may be modified to foe, girl to goal, burn to bone, etc. In this way, the 
diphthong will be kept distinct from /3:/ (see comparative examples in 
(1) above). In addition, proper prominence must be given to the Ist 
element and reduction of the total length of the glide made in the 
appropriate contexts. 


7.10.5 /au/ 
(1) Examples.—ou, ow—house, sound, out, cow, town, allow 
(Note: ‘Macleod’ /ma klaud/ .) 


Long [a:u]—how, loud, town, cows 

Reduced [au] —shout, about, mouse, mouth 

Compare [a:U], [au] —allows, a louse; found, fount; mouth (v.), 
mouth (n.); loud, lout 

[t] following—cowl, foul, owls 
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(2) Description.—The glide of RP /au/ begins at a point between the 
back and front open positions, slightly more fronted than the position 
for RP /a:/, i.e. C[G], and moves in the direction of RP /u/, though 
the tongue may not be raised higher than the half-close level, i.e. [6]. 
The glide is much more extensive than that used for /əu/ and is 
symmetrically opposed to the front glide of /a1/. The lips change froma 
neutrally open to a weakly rounded position. 

(3) Variants.—The RP diphthong /au/ is in opposition in the back 
region with /au/; if the latter has a starting point in the central area 
below half-close, the starting point of /au/ cannot be raised to any 
extent without the possible loss of contrast between such words as fone 
and town. RP variants, therefore, involve particularly the fronting or 
retraction of the starting point rather than its raising. Considerable 
latitude is permitted between the values C[á] and C[a]; for many 
speakers, the 1st element of /ai/ and /au/ may in fact be identical. 
Since, however, several popular regional forms of speech (especially a 
modified popular variety of the London region) have typically a Ist 
element in the C[a] or [ee] areas, reaction amongst careful speakers 
causes the diphthong to have a more retracted starting point, sometimes 
reaching C[a]. In some kinds of RP, usually of an advanced type, the 
[a] element is extra long, especially in those contexts in which the diph- 
thong has its fully long form, with a weak glide involving comparatively 
little raising of the tongue and little, if any, lip rounding; /oud and /ard 
may, therefore, be distinguished only by a slight movement at the end 
of the vowel. In London speech, both in the popular and modified 
forms, the 1st element may be of the [e] or [se] varieties. It would appear 





HIG. 25.—/au/ and variants. 
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that with such a glide there would be risk of confusion with the popular 
London realization of /əu/ as [zu] or [ay], as between such words as 
now and no; in fact, the starting point used for /aU/ may be more open 
than that of /au/, e.g. no [na¥], now [néu]. Even if the two starting 
points are on the same level, however, the diphthongs are kept 
phonetically separate by the greater centralization of the 1st element of 
/əu/ and also, in some cases, by the closer end point reached in /au/ as 
compared with that of /au/: [Æ¥], [ay], [äu] or [Æw] for /əu/ as against 
[ee3] or [e3] for /au/. In addition, in popular London speech, /au/ may 
be realized as a long, relatively pure vowel of the [a] type, e.g. town 
[ta:n]. 

In those kinds of English, e.g. Scottish and some Northern, which 
use [o:] or [ou] for RP /əu/, a more central type of starting point is 
often used for /au/, i.e. [pu] or [AU]. In the regions surrounding 
London a closer, front starting point may be heard, i.e. [eu], often with 
unrounding of the 2nd element, [ev]; in the West Country the 2nd 
element may be rounded but fronted, [éy] or [ay]. 

(4) Chief sources.—RP /au/ has origins and development in the 
back vowel area similar to those of RP /a1/ in the front. PresE /au/ 
derives regularly from ME [u:], whose main sources are: OE [u:] (cow, 
house, mouth); lengthened OE [u] (ground, found); OE [u] or [o:] + 
[w] or [y] (fowl, bow, bough); OF or Anglo-Norman [u:] (allow, 
powder, couch, count, mountain). Diphthongization of the pure vowel 
begins in the fifteenth century ({Uu]) and the Ist element is progressively 
lowered by a central route ({au] or [AU]) during the sixteenth and seven- 
teenth centuries, there being no confusion with the development ME 
[5:] or [ou] > [o:] or [ou] > PresE /au/. The more open Ist element of 
/au/ must have become established during the eighteenth and nine- 
teenth centuries. 

In types of Northern and Scottish English the diphthongization did 
not take place, the original [u: ] being retained, e.g. house [hu(:)s]. The 
pure vowel [u:] is also sometimes retained in RP in the vicinity of 
bilabial consonants, e.g. in wound (‘to injure’), droop, stoop; in other 
case, ME [u:] was shortened and gives PresE /a/ (plum, crumb, 
thumb). 

The present spelling forms ou, ow, for /au/ result from the French 
influence on the language. OE /u:/ was spelt with 1, but this letter had 
a different value in French, [y]; by ME, therefore, the French spelling 
ou or ow for the sound [u:] was generally used. 

(5) Advice to foreign learners.—Just as for /al/, foreign learners 
should be careful to use a correct 1st element, 1.e. a variety which is not 
so fronted or raised as to be dialectal; a starting point too near to C[a] is 
also to be avoided. The Ist element should be the most prominent and 
the 2nd element only lightly touched on, the tongue closing toa position 
not higher than half-close, i.e. [6]. 
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7.11 Diphthongs + [a] 


All the preceding diphthongal glides /el, al, 91, 90, au/ are falling (i.e. 
with length and stress on the Ist element) and closing (1.e. gliding from a 
more open to a closer position); three of them, /al, 91, aU/, require an 
extensive movement of the tongue. All may be followed by [3] within 
the word, either as an inseparable part of the word, e.g. Noah, fire, 
choir, iron, hire, society, our, sour, tower /nəuə, fala, kwara, alan, 
halo, so'salatl, ava, saua, taua/ or as a suffix (morpheme) appended to 
the root, e.g. greyer, player, slower, mower, higher, drier, employer 
/grela, plera, slaua, məuvə, hais, draia, m'plo1o/ or, sometimes, as a 
separable element internal in a composite form, e.g. nowadays 
/‘navadeiz/. In such cases, a third vocalic element [a] may, in slow 
speech, be added to the two elements of the diphthongal glide; but there 
is a tendency in rapid and advanced RP to omit the 2nd ([1] or [v]) 
element, especially when [ə] is not felt as a separable morpheme. This 
process is sometimes known as smoothing. 

(1) [ato] > [a:o] in general RP, e.g. in fire, tyre, choir, society, hire, 
shire, byre, lyre, liable, and also in cases where [9] may be considered as 
a separable suffix, e.g. higher, shyer, buyer, liar. 

(2) [aua] > [a:3] in general RP, e.g. in our, shower, flower, coward, 
nowadays. 

It will be seen that the reduction of the phonetic sequences [a13, aua] 
to [az9, a:3] results in a phonemic opposition relying on a distinction 
between [a:] and [a:]. It is natural that such a tenuous qualitative 
difference should be levelled out, with the result that both original [a13] 
and [aus] are frequently reduced to a diphthongal glide whose Ist 
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FIG. 26.—/aia, aus/ and variants. 
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element is a central open vowel. Several new homophones are produced 
in this way, e.g. tyre, tower; shire, shower; sire, sour. 

In addition, in advanced RP the diphthongal pronunciations thus 
produced are often further reduced to a long monophthong, 1.e. [a:a] 
> [a:] and [a: 3] > [az]. If [a:] and [a:] are kept distinct, there is never- 
theless confusion between [a:] < [aua] and /a:/, resulting in such 
homophones as shower, Shah; tower, tar. A more extensive levelling 
(criticized as an affectation and also as a Cockney vulgarism, but widely 
heard amongst educated RP speakers) reduces both [a:3] and [a:3] to 
[az], so that homophones of the type shire, shower, Shah; tyre, tower, 
tar; byre (or buyer), bower, bar, are produced, all with /a:/. This 
monophthongization of /aia/ and /aua/ and their coalescence with 
/a:/ is likely to be one of the most striking sound changes affecting 
Southern British English in the twentieth century. 

(3) [e13] > [e:3] in general RP, e.g., in player, greyer, conveyor, 
layer. In these examples, in which it is a question of /e1/ + an /9/ 
suffix, the resultant diphthong is frequently levelled with the /ea/ of 
there, rare, etc. Thus, such homophones as prayer, pray-er; lair, layer, 
are produced (see §7.12.2 for reduction /e9/ to [e:]). 

(4) [ava] > [a:] = /3:/ in general RP, levelling frequently occurring 
between mower, Slower, and myrrh, slur (with /3:/). 

(5) [019] > [9:9] in general RP, as in employer, enjoyable, buoyant, 
joyous. In these cases the [o:] element of the diphthong is qualitatively 
distinct from the value associated with /a:/, since it has a tongue 
position not higher than half-open. Thus, drawer (‘one who draws’) 
with /o:/ + /a/ may have a closer initial vowel element than the 
Starting point of the glide in the reduced form of coir, with [9:9]. 

Some speakers distinguish between sequences of diphthong + /al/, 
usually in the case of terminations spelt -e/, -a/, e.g. trial, towel, royal, 
and sequences of diphthong + /l/, e.g. tile, owl, toil. However, the 
first sequence, containing three vocalic elements, may be reduced to a 
centring diphthong; and, in the case of the second type of sequence, an 
[a] (or [uJ) glide is present before [t], so that a similar triphthongal glide 
reducible to a centring diphthong is produced. Thus, /el, al, au, 91/ 
followed by either /I/ or /al/ tend to be realized as [e:9, a:9, a13 or aru, 
9:9], the examples given above being perfect rhymes. In the case of /9Uu/ 
+ [t], the [vu] element of the diphthong may be retained, both because it 
is reinforced by the glide on to [t] and also in order to maintain the 
distinction /296/-/3:/ as in pole, pearl. 

The reduction of diphthong + [ə] toa centring diphthong takes place 
not only within words but also between a word final diphthong fol- 
lowed by word initial /9/, e.g. they are [0e:3] or [0e:3], rhyming with 
there; go away [ga:9 wel] = /'g3:9'we1/; buy a house ['ba: a haus]; now 
and then [na:an' den]; boy and girl ['bo:0n'g3:1). 

The weakness of the final elements of diphthongs is also demon- 
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strated by their instability before vowels other than [9]. Thus, in the 
case of /ei, al, 91/, when /1:/ or /1/ follow (i.e. a vowel articulation at or 
closer than the end point of the diphthongal glide, the [1] 2nd element of 
the diphthong may be lost, e.g. in playing, way in, they eat it, highest, 
hyaena, buy it, try each, annoying, the boy easily . . .; the [1] element 
may also be absorbed before other vowels, e.g. in way up, by all means, 
they understand, toy engine, though some glide in the direction of [i] is 
likely to be made when the following vowel has a quality near to that of 
the Ist element of the diphthong (especially of the wide diphthongs /al, 
91/), e.g. in may end, my uncle, the boy often. . . . In the case of /au, 
au/, absorption of the [u] element before /U/ or /u:/ rarely arises, since 
a following /U/ or /u:/ is unusual. Again, absorption of the [u] element 
of the narrow diphthong /2u/ frequently occurs before other vowels, 
e.g. in go easy, glowing, no end, go off, know all, show up, though 
when /3:/ follows some movement towards [v] and lip-rounding 
normally takes place, e.g. in so early. Loss of the [u] element of /ou/ 
does not, in pre-vocalic positions, lead to confusion with /3:/, since 
/3:/ in such a position will normally be realized with a linking /r/, cf. 
slow it / slu 1t/ or /sla : 1t/ and slur it /sla:r it/. In the case of /au/, 
loss of the [u] element may also occur before vowels other than [o], e.g. 
in allow each, vowing, how else, now or never, but when the following 
vowel has an open quality similar to that of the 1st element of /au/ 
some tongue movement towards [u] and lip-rounding normally takes 
place, e.g. in how are they, plough up, how odd. This tendency to 
absorb the 2nd element of diphthongs before other vowels is a feature 
which is more marked in advanced RP than in either the general or the 
conservative forms of this kind of English. 

A similar weakening of the monophthongs /u:/ and /1:/ sometimes 
occurs across syllable boundaries. /u:/ may be replaced by /u/ and /1:/ 
by /1/ before vowels in such phrases as two in the morning /'tu m da 
'mo:nin/ and three o’clock /'9r1 9'klok/. It is possible in a word such as 
ruin to regard the pronunciation /‘rum/ as a version of /‘ruzin/ 
exhibiting smoothing. 

Foreign learners should be aware of this tendency to reduction of 
vowel sequences, in order that they may understand colloquial English. 
They will observe that such reduced forms are normal among many 
educated speakers. Nevertheless, like most changes of pronunciation, 
these reductions are often condemned as vulgarisms—frequently by 
those who use them and are not aware of the fact. Foreign learners 
should, therefore, avoid the extreme forms of reduction, e.g. [a:] and 
[a:] for [ato] and [ava], and [3:] for [ava]. But the levelling to [a:a], 
[azo], [e:3], and [9:9], described above, may be taken to be current and 
permissible. Certainly such pronunciations are preferable to sequences 
containing an exaggerated [1] or [u] element, i.e. [3] or [w], giving [aja], 
[awa], [awa], [eja], [oJa], etc. 
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7.12 Centring Diphthongs /1, e9, U3/ 


A /13/ 


(1) Examples.—eer, ear, ere—deer, dear, tear (drop of liquid), here 
eir, ier, ir—weird, fierce, fakir 
ea, ia, eu, eo—idea, Ian, museum, theological 


(Note: ‘hero’ /hiarav/, ‘year’ /j19/ or /j3:/.) 


Long [1:3] —dear, here, cheer, beard 
Reduced [19] —pierce, fierce 
Compare [1:9], [19] —fears, fierce 

[t] following'—real 





FiG. 27.—Final /19, €9, va/ and variants. 


(2) Description.—The glide of RP /19/ begins with a tongue position 
approximately that used for /1/, 1.e. centralized front half-close, and 
moves in the direction of the more open variety of /9/ when /19/ is final 
in the word; in non-final positions, e.g. in beard, fierce, the glide may 
not be so extensive, the quality of the [a] element being of a mid type. 
The lips are neutral throughout, with a slight movement from spread to 
open. 

It has been pointed out by Daniel Jones’ that the sequence [i] + [ə] 
may not always constitute the falling diphthong described, 1.e with pro- 


¡This solitary example of /19/ + {t] may be taken as a variant of /i:/ + {t] (cf. also 
ideal). It is possible to pronounce real with /i: / in the same way as eal, feel, etc.; /19/, 
however, seems to be more common in this particular word, cf. reel /ri:1/, rill /ril/. 

2See Daniel Jones, ‘Falling and Rising Diphthongs,’ Miscellanea Phonetica, 11 (Inter- 
national Phonetic Association, 1954). 
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minence on the Ist element. In unaccented syllables the [1] element may 
be the weaker of the two, being equivalent to [j], cf. the two [1] + [9] 
sequences in both syllables of period, serious [‘piariad] or ['prorjad] and 
['siorias] or ['siorjas]. The glide of the rising type [ía] is often used when 
the [9] represents a termination with morpheme status, e.g. easier, 
carrier, in which case [1] and [9] are conveniently treated as a hiatus of 
vowels in two syllables! with a variant monosyllabic pronunciation [ja]. 
Such a solution may also be applied to those cases where it is not a 
question of a morpheme boundary, e.g. hideous, genius, idiom, 
billiard, morphia, etc. In some cases the choice between a falling or a 
rising diphthong may depend upon the accentual pattern placed upon 
the word, e.g. theological [013 lod31k1] (falling diphthong on the first 
syllable which carries secondary accent) or [8%9'lnd3ikl] (rising 
diphthong when the first syllable is unaccented). It may, therefore, be 
said that the falling diphthong in the last syllable of reindeer depends 
upon a degree of accent on that syllable, cf. the rising diphthong in the 
unaccented last syllable of windier.? 

(3) Variants.—In some kinds of advanced and conservative RP, and 
especially when /19/ is final, the prominence and length shift to the 2nd 
element of the accented diphthong, this final quality often being the 
most open type of /3:/ or /aA/ or even /a:7. Thus, here, dear, may be 
realized as /hj3:, dj3:/ or /hja: = [ca:], dja: /. The form with /a:/ is 
usually characterized as an affectation. In popular London speech a 
glide from a relatively close to an almost open position may be heard— 
[i] —-[á], sometimes with an intervening [j]. In those kinds of English 
where post-vocalic /r/ is pronounced, RP /19/ is realized as /i:/ + /r/ 
in those words which have an rin the spelling of the RP diphthong, e.g. 
here /hi:r/. 

(4) Chief sources.—PresE /19/ results generally from ME [e:] + [r] 
(here, hear, dear, weary, appear, clear) and from early ME [e:] + [r] 
(ear, shear, fear, beard). In the sixteenth and seventeenth centuries 
some variation existed between [1:41] and [e:1], but by the seventeenth 
century [1:1] was predominant; [1] was lost and replaced by [a] in the 
eighteenth century. In those cases where no r occurs in the spelling, e.g. 
idea, lan, museum, the [a] element, derives as usual in English from the 
obscuration of the weak vowel. 

(5) Advice to foreign learners.—Foreign learners should avoid using 
a lst element which is too close, i.e. /1/ should be used rather than 
/i:/, Although the r, which occurs frequently in the spelling of this 


! Increasingly a closer vowel is to be heard in such words, as (i: zio], [kaeria] which may 
be interpreted as /i:zi:9/ and /keeri: 9/, an interpretation which supports the view that 
there is vowel hiatus. 

2See B. S. Andrésen, ‘-dier and -deer; an experiment,’ Le Maitre Phonétique, No. 108, 
1957. 
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diphthong,.should not be pronounced finally or before a consonant, it 
should be remembered than an [r] link is regularly made before a 
following vowel, either initial in the next word of the group, e.g. here 
and there /‘hiar an 'de3/, or occurring in the following syllable of the 
same word, cf. hear /hio/ and hearing /‘hiarin/. 


TZ PEST 
(See Fig. 27) 


(1) Examples.—are—care, rare, share, mare 
air—air, fair, pair, chair 
ear—bear, pear, wear, tear (v.) 


(Note: with /ea/, ‘heir, there, their, Mary, Sarah, scarce, aorist’.) 


Long [e:3]—pair, there, chairs, cared 
Reduced [e9]—scarce 
Compare [e:3], [ea] —scares, scarce 


(No cases of /ea/ + [t].) 


(2) Description.—The glide of RP /ea/ begins in the half-open front 
position, i.e. approximately C[e],' and moves in the direction of the 
more open variety of /3/, especially when the diphthong is final; where 
/ea/ occurs in a syllable closed by a consonant, the [ə] element tends to 
be of a mid [a] type. The lips are neutrally open throughout. (See 
§7.11(3) for the reduction of /e1/ + /a/ to /e9/.) 

(3) Variants.—RP /e3/ has variants mainly in respect of the degree 
of openness of its lst element. The presence of the opposition /13/-/€3/, 
e.g. fear-fair, would appear to inhibit too great a raising of the starting 
point of /ea/. Nevertheless, in popular London speech, where the 
starting point of /13/ may be closer (see §7.12(3)), /ea/ tends tostartata 
point slightly lower than Ce], i.e. slightly below half-close. A feature 
of conservative and advanced RP is an even greater opening of the Ist 
element of /e9/, i.e. [çə] or [aa], the glide being very slight. Another 
form of advanced RP uses a long pure vowel {e:], often somewhat 
centralized, especially in a non-final syllable, e.g. careful, scarcely 
(‘ke: ft], ['ske: sl1]. A centralized pure vowel is also a feature of certain 
Midland and Northern speech, notably that of Birmingham and Liver- 
pool. In those types of regional English where post-vocalic r is 
pronounced, RP /ea/ is replaced by /e:1/ or /e:r/, 6.2. fair ie. 

(4) Chief sources. —PresE /ea/ derives from three ME sources: ME 
[a:] + [r] (care, hare, mare); ME [e1] or [zi] + [r] (their, air, hair, fair); 
ME [e:] + {r] (bear, there, where, swear). We have seen (§7.10.1(4)) 


lIt is to underline the half-open beginning of /ea/ that the diphthong is transcribed 
here with [e] rather than [e]. A simplified transcription using [e] would, however, lead to 
no ambiguity in the representation of the RP vowel system. 
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that ME [a:] and [æi] tended to coalesce into [e:] in e ModE and they in 
turn are confused by the seventeenth century with the descendent of ME 
[e:] when followed by [r]. It is likely that all words with PresE /29/ were 
pronounced with [e: 1] (or perhaps [eis]) in the seventeenth century. The 
diphthongal glide to [a] results from the loss of post-vocalic [1] in the 
eighteenth century. 

(5) Advice to foreign learners.—The post-vocalic r of the spelling 
forms should not be pronounced, except as a linking form when a 
following word begins with a vowel, e.g. pair of shoes /'pear av 'fu:z/, 
or when a vowel occurs in the following syllable of the same word, e.g. 
care /kea/, but caring /'keorin/. Attention should also be paid to the 
half-open articulation of the Ist element of /€9/; in order to emphasize 
this point, it is often helpful to begin the glide from the quality of RP 
/e/, cf. cat and care. 


7.12.3 /U9/ 
(See Fig. 27) 


(1) Examples.—oor—poor, moor 
ure—pure, endure, cure, sure 
ur—curious, spurious, during, security 
ewer— sewer 
our—tour, dour, gourd 


(Note: /u3/ usually occurs in ‘jewel, fluent, truant’.) 


(2) Description.—RP /ua/ glides from a tongue position similar to 
that used for /u/ towards the more open type of /3/ which forms the 
end-point of all three centring diphthongs with, again, a somewhat 
closer variety of [ə] when the diphthong occurs in a closed syllable. The 
lips are weakly rounded at the beginning of the glide, becoming 
neutrally spread as the glide progresses. 

In the same way that the sequence [1] + [9] may constitute a rising 
diphthong, the sequence [u] + [a] may also, in unaccented syllables, 
have the prominence on the 2nd element,! e.g. in influence, valuable, 
vacuum, jaguar, etc., the Ist element often weakening to [w]. In many 
cases of such sequences, [ta] represents a realization of a final 
unaccented /u:/ + morpheme /2/, e.g. rescuer, /'reskju:/ + /9/ > 
['reskj0a].? 

(3) Variants.—/33/ having coalesced with /o:/ for most RP 
speakers, the pattern of centring diphthongs is rendered asymmetrical, 
there being only one back glide of this type opposed to the two front 
glides. As a result, the Ist element of /ua/ can be lowered considerably 


1See Daniel Jones, op. cit. 
2As with /19/ (see §7.12(2)) this unaccented /u:/ may be realized as a closer short 
vowel, thus [reskjua]. 
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without risk of confusion. Thus several words with /ua/, which have a 
pronunciation [us] for some RP speakers, are given by others a glide 
[99], e.g. in poor, sure. This glide [99] may in turn be levelled with the 
realization of /o:/. Thus, Shaw, sure, shore, still pronounced by some 
/for, fua, foa/, are levelled by many others to /fo:/ for all three words; 
or again, you're (most frequently with /ua/) may be realized as /ja:/, 
i.e. identical with your. It is to be noted, however, that such lowering or 
monophthongization of /ua/ is rarer in the case of less commonly used 
monosyllabic words such as dour, gourd. 

Where /j/ precedes /ua/, e.g. in cure, curious, puerile, secure, 
endure, bureau, the glide from close front [j] through back rounded [uv] 
to central half-open [ə] may be reduced not only to /jo:/, as described 
above, but also to a glide from [j] to a long central vowel, i.e. [J9:] or 
[jA:]. This latter pronunciation is a characteristic of upper-class RP. In 
popular London speech, /ua/ levelled with /9:/ in such words as poor, 
sure, may be realized as [9: wa] (see also §7.9.8(3)). In those kinds of 
English in which post-vocalic r is pronounced, RP /uo/ is realized as 
/u(:)/ + /r/ in those words which have an r in the spelling, e.g. poor 
/pu:F/. 

(4) Chief Sources.—PresE /ua/ derives generally from an earlier 
vowel or diphthong + [r], e.g. < ME [o:] or [9:] + [r] (moor, poor, 
boor); < ME [u:] + [r] (mourn, for those RP speakers who use /ua/ 
rather than /d:/)—it is to be noted that most cases of pure vowel + [r] 
have been lowered to /9:/, e.g. fourth, floor, court (see §7.9.8(4)); and 
< ME [iu] or [eu] + [r] or morpheme [or] (sure, pure, fewer). In these 
cases the [a] element developed with the loss of [1] in the eighteenth 
century. Sometimes, however, /ua/ arises from the juxtaposition of 
/u:/ or /u/ and another vowel which has weakened to [a] (influence, 
truant, virtuous, jewel). 

(5) Advice to foreign learners.—Care should be taken to use a Ist 
element of a half-close kind rather than a quality resembling that of 
/u:/. In addition, the spelling r should not be pronounced, except when 
a [r] link is made before a following vowel, either occurring initially in 
the next word, e.g. poor old man /'puar ‘auld 'meen/, or in the following 
syllable of the same word, e.g. tour /tua/ but touring /'tuarin/. 


7.13 Vowels in Relatively Weakly Accented Syllables 


Except in the case of /3/, which occurs normally only in unaccented 
syllables, we have so far dealt mainly with vowels in syllables carrying a 
strong accent (see Chapter 9 for the nature of accent in words).! In each 
polysyllabic word one syllable is said with more stress than the others 


l For a detailed analysis of the relation of vowel qualities and word accentual positions, 
see G. F. Arnold, Stress in English Words (North-Holland Publishing Co., 1957). 
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and has a special prominence, e.g. the second syllable in endeavour or 
the penultimate syllable in examination. This is the tonic or primarily 
accented syllable (marked here with [']). The remaining syllable or 
syllables may be unaccented, or may carry a secondary accent asso- 
ciated with a rhythmic beat or particular vowel qualities (marked here 
with [,]). The following table illustrates the occurrence of vowels in a 
selection of non-tonic situations—in words containing from two to five 
syllables. The first column, Remote Preceding, shows vowels in a place 
more than one syllable removed from the tonic; the second, Adjacent 
Preceding, vowels immediately preceding the tonic; the third, Adjacent 
Following, vowels immediately following the tonic; and the fourth, 
Remote Following, vowels in a place more than one syllable after the 


tonic. 


Remote Adjacent Adjacent Remote 
Preceding Preceding Following Following 
(a) /1/ ina bility effect Sorry apathy 
107 superiority! silhouette ambulance neighbourhood 
ey; consideration allow mother character 
(b) Short 
/e/ referee September prefect ‘architect 
Vey maga zine canteen syntax caravan 
/a/ subjec tivity sul phuric product ‘aqueduct 
/D/ politician Oc'tober diphthong catalogue 
Long 
LVF preconceive aes thetic phoneme obsolete 
Fao artisan! sarcastic placard reservoir 
1907 audi bility au gust (adj.) record corridor 
u7 super sede July neph ew residue 
357 perpen dicular urbane expert ‘universe 
Diphthongs 
7 el/ phrase ology aorta detail magistrate 
Jai/ bio logical minute (adj.)  'missile civilize 
/21/ employee convoy celluloid 
/23U/ photographic November window episode 
/au/ counter act! outrageous compound eiderdown 
/19/ superiority theological : frontier ‘overseer 
/€a/ varia bility where by fanfare ‘underwear 
/ua/ neuro logical curator contour manicure 


It will be seen that totally unaccented syllables are associated par- 
ticularly with vowels of a central or centralized quality (or a syllabic 
consonant), i.e. /9/, /1/, and /u/ (though /u/ in a weak situation 
is normally replaceable by /a/ or may be reduced to /w/ before a 


i In the speech of some people. 
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following vowel). The other vowel phonemes of RP may also, however, 
occur in syllables not carrying the primary accent, but the examples in 
(b) above illustrate tendencies of correlation between degree of accen- 
tuation and vowel quality:— 

(1) The relevant syllables of the examples of Remote Preceding and 
Adjacent Preceding occurrences tend to carry a secondary accent, e.g. 
referee / refa'ri:/, canteen / keen'ti:n/. Alternatively, all the examples 
of Adjacent Preceding occurrences, with the exception of those con- 
taining /z/, /a1/, /91/ and /au/, may be heard pronounced in a totally 
unaccented form with /a/, /1/, or /u/, or, in the case of theological 
with /ja/. 

(2) The relevant syllables of the examples of Remote Following 
occurrences may also carry a secondary accent, of a rhythmical kind, 
especially when the syllable contains a long vowel or diphthong, e.g. 
reservoir /'‘reza,vwa:/, civilize /'siva,laiz/. 

(3) A rhythmic secondary accent may also be associated with the 
final syllable of Adjacent Following words under (b), again especially 
if it is a question in RP of a long vowel or diphthong, e.g record 
/‘reko:d/, detail /‘di:,teil/; if this secondary accent is lacking, as in 
other educated and popular forms of English, the vowel may be elided 
or reduced to /9/ or /1/, e.g. product, placard, record, nephew, missile, 
window, as / prodakt, 'pleekad, rekad, nevi, 'misl, 'winda/. 

(4) With the possible exception of subjectivity (/,sabd3ek tivati/ or 
/ sabd3ik tivati/), all intervening unaccented vowels in the Remote Pre- 
ceding and Remote Following examples in (b) above are /3/ or /1/. 

It may be concluded, therefore, that, especially in the case of words 
in common use, the maintenance of a vowel quality other than /9/, /1/, 
or /u/ in syllables not carrying the tonic accent will often depend upon 
the presence of a secondary degree of accent on the syllable in question. 
Alternatively, it may be said that our feeling for such secondary accents 
derives largely from the quality of the vowels contained in the syllables. 

The present relationship of vowel quality and degree of accentuation 
arises from the various conflicting phonological influences to which 
English has been exposed over the last 1,000 years. As a general rule, 
weak accent in OE led to the obscuration of short vowels and the 
shortening of long vowels. By ME, however, new long vowels or diph- 
thongs under relatively weak accent emerged as a result of vocalization 
of earlier consonantal articulations, e.g. [i1:] < OE [1] + [g] or [j] 
(holy), [ou] deriving from an intrusive [o] or [9] before [y] or [w] 
(follow). Moreover, words of French origin such as empire, increase 
(n.), while shifting their main accent back to the first syllable, kept the 
full vowel quality in the final syllable or, in the case of polysyllables 
such as justify, temporary, retained a secondary accent on a syllable 
following the primary accent. (In words of the temporary, secretary 
type, American English, for instance, keeps this secondary accent on 
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the penultimate syllable with a full [e] vowel, as was the case in English 
up to the eighteenth century, whereas in present RP the secondary 
accent has been lost and the former [e] or [e:] reduced to [ə] or elided.) 
As a result of influences of this kind, full vowels and usually secondary 
accent are found in eModE in a number of situations where to-day 
greater accentual and qualitative weakening is once again the rule, ' e.g. 
certain, bargain, with [ei] in the final syllable in eModE, history, 
majesty, tragedy (and also an English word such as merrily) with [91] 
finally, and emperor, saviour, with [our] finally in eModE. 


7.14 The Frequency of Occurrence of RP Vowels 


According to an investigation by D. B. Fry? of the frequency of vowels 
in colloquial RP, /a/ (10.74 per cent) and /1/ (8.33 per cent) clearly 
emerge as the vowels having the highest count. This is to be expected, 
since /a/ is the most common vowel in unaccented syllables in a 
language which has a high proportion of unaccented syllables, and /1/ 
has a high frequency of occurrence in both accented and unaccented 
syllables. The following is the order of the remaining RP vowels:— 


% % 
/e/ —2:97 /u:/—1:13 
/ai/—1:83 /u/—0:86 
/A/—1:75 /a:/—0:79 
/ei/—1-71 /au/—0:61 
/i: /—1-65 /3./—0:52 
/2Uu/—1:51 /ea/—0°34 
/e/—1:45 /19/—0:21 
/D/—1:37 /21/—0:14 
/9:/—1:24 /ua/—0:06 


Total all vowels: 39-21% 


l For an account of vowels in unaccented syllables in e ModE see E. J. Dobson, English 
Pronunciation 1500-1700, vol. 11, pp. 827-926 (Oxford, 1957). 

2D. B. Fry, ‘The Frequency of Occurrence of Speech Sounds in Southern English,’ 
Archives Néerlandaises de Phonétique Expérimentale, XX (1947), pp. 103-6. 
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The English Consonants 


8.1 The Distinctive Consonants 


It is possible to abstract from a continuous utterance of English by 
means of a process of commutation (see $5.3) twenty-four distinctive 
units which are consonantal both in terms of their function (1.e. they 
tend to be non-central or marginal in the syllable—see §5.5) and also, in 
the majority of cases, in terms of their phonetic nature (i.e. they have, 
at least in some of their realizations, articulations involving the obstruc- 
tions or narrowings which produce, acoustically, a noise component 
—see 94.2). 

These twenty-four consonantal phonemes are classified in the table 
below in two general categories: — 

A. Those articulations in which there is a total closure or a stricture 
causing friction, both groups being typically associated with a noise 
component (obstruents); in this class there is a distinctive opposition 
between fortis and lenis types. (For the reasons given in $4.3.6—1.e. 
that the distinctive voicing feature is not always operative in the pairs 
/p, b; t,d; k,g; tf, d3; f, v; 8,0; s, z; f, 3/—it seems preferable to use the 
more generally valid labels fortis and lenis to categorize the two sets 
rather than ‘voiceless’ and ‘voiced’). 

B. Those articulations in which there is only a partial closure or an 
unimpeded oral or nasal escape of air; such articulations, typically 
voiced and frequently frictionless (sonorants, without a noise compo- 
nent) may share many phonetic characteristics with vowels. 


= ke X 
TES = = $3 2 MA 
a a a =< EX ct A Se 
A. Plosive p, b td k,g 
Affricate (tr, dr) tf, d3 
Fricative (a) E OE Oo} ese ilies h 
B. Nasal m n 0 
Lateral l 


Approximant ' w r j 


lor frictionless continuant or glide (semi-vowel). 
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Note 


(1) In some types of RP it may be necessary to include the labial-velar 
fortis voiceless fricative [mM] as a phoneme. 

(2) In practical teaching it may also be convenient to treat /tr/ and 
/dr/ as distinctive affricates as well as /tf/ and /d3/ (see §8.3). 

(3) The glottal stop [?] has been excluded, since it is not phone- 
mically distinctive in RP; its use as a reinforcement for vowels and its 
allophonic association with /p, t, k/ will be treated in §8.2.7. 

It will be seen from the arrangement above that:— 

(a) the plosive and nasal phonemes fall into three contrastive groups 
as far as the place of articulation is concerned, i.e. bilabial, alveolar, 
and velar; 

(6) the affricate, lateral, and /r/ phonemes have an alveolar basis; 

(c) the fricatives have five areas of articulation, i.e. labio-dental, 
dental, alveolar, palato-alveolar, and glottal. 

These basic areas of articulation, convenient for labelling the 
phoneme, may be extended in the various allophonic realizations of the 
phoneme, but for any particular situation in a context the number of 
oppositions involving the place of articulation will remain unchanged; 
thus, the allophones of /t/ may be dental or post-alveolar, or the 
allophones of /k/ palatal, without constituting additional distinctive 
areas of articulation, since such variants are conditioned by the context. 


Class A 
Consonants Involving Closure or Stricture 
(Obstruents) 


8.2 Plosives 


The complete articulation of a pulmonic egressive plosive, or stop, 
consonant consists of three stages: — 

(1) the approach or closing stage, during which the articulating 
organs move together in order to form the obstruction; in this stage, 
there is often an on-glide or transition audible in a preceding sound 
segment and visible in an acoustic analysis as a characteristic curve of 
the formants of the preceding sound: 

(2) the hold or compression stage, during which lung action com- 
presses the air behind the closure; this stage may or may not be accom- 
panied by voice, i.e. vibration of the vocal folds; 

(3) the release or explosion stage, during which the organs forming 
the obstruction part rapidly, allowing the compressed air to escape 
abruptly; if stage (2) is voiced, the vocal fold vibration may continue in 
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stage (3); if stage (2) is voiceless, stage (3) may also be voiceless 
(aspiration) before silence or before the onset of voice (as for a follow- 
ing vowel), or stage (3) may coincide with the onset of vocal fold 
vibration, as when a voiceless plosive is followed without intervening 
aspiration by a vowel; again, an off-glide or transition associates the 
plosive with a following sound. 

Since a condition of plosive articulation is that the whole of the 
speech tract behind the primary closure should form a chamber sealed 
to the escape of air, and since the primary closures for the English 
plosives are normally made in the oral cavity, the soft palate must be 
held in its raised position for at least the first two stages of these 
articulations. ! 


8.2.1 The Significant Phonetic Features of English Plosives 


The RP plosive phonemes comprise three pairs: /p, b/; /t, d/; /k, g/. 
The following words illustrate oppositions in word initial, medial, and 
final positions: — 


/p/ /b/ /t/ /d/ IRI /9/ 

Initial pole bowl toll dole coal goal 
Medial riper — writer rider — — 

— — bitter bidder bieker bigger 

caper eaber eater — — — 

— rubber — rudder — rugger 

lopping lobbing — — loeking logging 
Final rip rib writ rid riek rig 


These oppositions may be realized by means of one or several of the 
following phonetic features:— 

(1) Place of articulation.?—/p, b/, bilabial; /t, d/, alveolar; /k, g/, 
velar. 

(2) Force of articulation.—/p, t, k/ tend to be pronounced with 
more muscular energy and a stronger breath effort than /b, d, g/; the 
former are known as relatively strong or fortis, the latter as relatively 
weak or /enis. (See §8.5 for fortis/lenis as phonological categories.) 

(3) Aspiration.—The fortis series /p, t, k/, when initial in an 
accented syllable, are usually accompanied by aspiration, i.e. there is a 
voiceless interval consisting of strongly expelled breath between the 


! When a nasal consonant precedes a plosive, the soft palate is raised in the first stage of 
the plosive. i.e. there is a nasal approach. 

i.e. the allophones of these phonemes are usually bilabial, alveolar and velar 
respectively. 
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release of the plosive and the onset of a following vowel, e.g. pin, tin, 
kin [p"tn, 'tIn, ‘k"In]. (The feature of aspiration is commonly regarded 
from an acoustic point of view as the voiceless interval occurring 
between the release burst of the stop and the onset of the voicing of the 
following sound, the time measured, which may be of the order of 
40—70 msecs., being referred to as the voice onset time—VOT: see, for 
instance, Abramson, 1977.) When /1, r, w, j/ follow /p, t, k/ in such 
positions, the aspiration is manifested in the devoicing of /l, r, w, j/, 
e.g. in please, pray, try, clean, twice, quick, pew, tune, queue; some 
devoicing may also occur in relatively unaccented situations, e.g. 
apricot, atlas, applicant, heckler, buckram, vacuum, etc. In other posi- 
tions, i.e. preceding a vowel in an unaccented syllable and finally, such 
aspiration as may occur is relatively weak, e.g. /p/ in polite, lip. When 
/s/ precedes /p, t, K/ initially in a syllable, there is practically no aspira- 
tion, even when the syllable carries a strong accent, cf. pin [prin] and 
spin [spin] (see §5.3.5).! In final positions, i.e. preceding silence, /p, t, 
k/ may have no audible release (see §8.2.3(1)). The lenis series /b, d, g/ 
is not normally aspirated (see, however, the next section). 

(4) Voicing.—The lenis series /b, d, g/ may have full voice during 
their second stage when they occur in positions between voiced sounds, 
e.g. in labour, leader, eager, rub out, read it, egg and. . ., to be, to 
do, to go. In initial and especially in final positions, i.e. following 
or preceding silence, /b, d, g/, while remaining lenis, may be only 
partially voiced or completely voiceless, e.g. in bill, done, game, cub, 
lid, bag, /b, d, g/ may be realized, initially, with vocal fold vibration 
beginning only in the last portion of the second stage and, finally, as 
[b, d, g]. It is, therefore, unusual in normal speech for the release stage 
of /b, d, g/ in final positions to be accompanied by a voiced off-glide 
[9]; with complete devoicing, any audible release will be accompanied 
by weak aspiration. The fortis series /p, t, k/ is not voiced. 

The parametric diagrams (below) provide a schematic representation 
of the timing of vocal fold vibration in different phonetic contexts. 

(5) Length of preceding sounds.—When the RP plosives occur 
finally in a syllable, their value is determined largely (since the voicing 
factor is not strongly operative) by the length of the syllable which they 
close. It is a feature of RP that syllables closed by fortis consonants are 
considerably shorter than those which are open, or closed by a lenis 


1 Accented /p, t, k/ also show some loss of aspiration as a result of a preceding /s/ 
which may be regarded as not in the same syllable, e.g. /k/ in discussed is only very 
weakly aspirated compared with the /k/ of custard, so that discussed may be distin- 
guished from disgust only by the fortis nature of /k/. Other pre-stop consonants (espe- 
cially fortis fricatives) appear to exert a certain absorption-effect upon the aspiration of 
initial /p, t, k/ in a following syllable or word—compare the degree of aspiration of the 
accented fortis plosives in half-past, push past, and go past; brief talk, finish talking, and 
no talking; rough coat, fresh coat, and two coats. 
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devoiced [d] 
fully voiced [d] 





partially voiced [d] (negative V.O.T.) 
unaspirated [t7] (zero V.O.T.) 





aspirated [t"] (positive V.O.T.) 





===-yoice Onset time (V.O.T.) (measured from the 

release of the stop to the onset of voicing) 
release — vocal folds not vibrating 
of stop mam Vocal folds vibrating 


consonant. We have Seen in the chapter on vowels that this variation of 
length is particularly noticeable when the syllable contains a ‘long’ 
vowel or diphthong, cf. the fully long vowels or diphthongs in robe, 
heard, league (closed by lenis /b, d, g/) with the reduced values in rope, 
hurt, leak (closed by fortis /p, t, k/). Preceding consonants, notably 
/1, n, m/, are also shortened by a following /p, t/ or /k/, especially 
when the consonants are themselves preceded by a short vowel, e.g. 
compare the relatively long /1/ in killed, Elbe, /n/ in wand, and /m/ in 
symbol with the reduced varieties in kilt, help, want, simple. A 
phonemic transcription of rope, robe, as /raup, raub/ is, therefore, to 
be interpreted as indicating that the words are distinguished not only or 
even primarily by a difference of the final consonant, but rather by a 
complex of quantitative and qualitative contrasts extending over the 
greater part of the word. The same effect of reduction also operates 
when /p, t, k/ occur medially in a word, cf. the length of /a1/ in rider, 
writer, although in this situation a more strongly contrastive voicing 
feature Is likely to be present in /b, d, g/. This reduction in the length of 
a vowel or a sonorant consonant before fortis consonants 1s sometimes 
known as pre-fortis clipping. 

Summary.—The RP plosives may, therefore, be said to be 
distinguished: — 

(a) by means of a three-term series in respect of place of articula- 
tion—bilabial v. alveolar v. velar; 

(b) at each point of articulation by the following phonetic features 
according to the situation (taking the bilabial oppositions as examples 
which are valid for the other two places of articulation):— 


Initial in accented syllable—/p/ voiceless fortis aspirated v. /b/ 
partially voiced or voiceless lenis unaspirated, e.g. pole v. bowl. 

Medial, following accented syllable—/p/ voiceless fortis weakly 
aspirated v. /b/ voiced lenis unaspirated, e.g. rapid v. rabid. 

Finally—/p/ voiceless fortis weakly aspirated, if released, 
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reducing length of preceding sounds v. /b/ voiceless (or voiced only 
at the beginning of the compression stage) lenis weakly aspirated, if 
released, preceding sounds retaining relatively full length, e.g. rope 
v. robe. 


It is clear that, initially in accented syllables, /p, t, k/ and /b, d, g/ 
are distinguished by the listener mainly through the presence or absence 
of aspiration, rather than through the presence or absence of voice. 

Foreign learners are, therefore, advised to pay particular attention to 
the aspiration of /p, t, k/ when these phonemes occur initially in an 
accented syllable. If a word such as pin is pronounced [pin], instead of 
[p"in], there is the danger that the English listener may understand bin, 
since he interprets lack of aspiration as a mark of the lenis /b/. The 
danger is particularly great for speakers of those languages, e.g. many 
in the Romance and Slav groups, where the opposition between lenis 
and fortis stops relies purely upon presence or absence of voice. (The 
aspiration distinction between /p, t, k/ and /b, d, g/ should also be 
retained, when /p, t, k/ are followed by /l, r, j, w/, by the devoicing of 
these latter, e.g. compare plight, try, crate, tune, twelve, with blight, 
dry, great, dune, dwell.) Such speakers should also avoid excessive 
voicing of the lenis series /b, d, g/, especially in final positions. On the 
other hand, speakers of some languages, e.g. of the Germanic type, 
tend to neutralize the /p/—/b/ kind of opposition in final positions, 
using a fortis plosive for both. It must be remembered that /b, d, g/ 
finally, though they may not be voiced, remain weak as compared with 
/p, t, k/, the preceding sounds retaining full length. 


8.2.2 Acoustic Features of English Plosives 


Perceptual cues, capable of being expressed in acoustic terms, may be 
provided by all three stages of plosive articulations, so that it is possible 
to distinguish: (1) plosives from other consonants, (2) /p, t, k/ from 
/b, d, g/, (3) the bilabial, alveolar, and velar types. 

(1) Plosives differ from other consonants mainly in the stage cor- 
responding to the articulatory ‘hold’. This part of the consonant is 
generally characterized acoustically by a perceptible period of silence 
throughout the whole spectrum or, in the voiced /b, d, g/, an absence 
of energy except at a low frequency. 

(2) /b, d, g/ may be distinguished from /p, t, k/ by means of a low 
frequency component present in the former, 1.e. voice. Moreover, there 
is likely to be a marked rising bend of Fl of the adjacent vowel in the 
case of /b, d, g/, which is not as marked in the case of /p, t, k/. How- 
ever, as we have seen, /b, d, g/ may often be voiceless, in which case 
they are distinguished from /p, t, k/—initially, by the comparatively 
weak burst of noise associated with the release stage and by the absence 
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of the gap (aspiration) characterizing /p, t, k/; finally, by their 
influence on the duration of the preceding sounds; medially, by the 
longer closure period (absence of energy) required for /p, t, k/.! 

(3) Cues to distinction as between the bilabial, alveolar, and velar 
plosives are provided, in some measure, by the noise burst related to 
their release stage. (Effective recognition is stimulated if the noise has a 
frequency span of some 480 cps and a duration of 15 msecs.) The actual 
situation of the burst of noise for each plosive category may, however, 
depend on the nature of the following sound. Thus, though the 
alveolars have a typical noise burst in the range of 3,000 to 4,000 cps, 
the bilabials may be characterized by a noise burst in the region of 360 
cps in the vicinity of all vowels, but considerably higher frequency 
bursts (e.g. in the region of 2,000 cps) may produce an impression of a 
bilabial plosive when in the vicinity of the closer front and back vowels. 
Again, the situation of the release burst recognized as velar will depend 
very much upon the nature of the following vowel. It has been found 
that, before /u:/, velar recognition is achieved with a burst in the 
region of 700 cps; the frequency of the characteristic burst seems to rise 
throughout the series /u:/-/a:/-/i:/, following the rising frequency of 
F2 of the vowels, and reaches about 3,000 cps for a vowel of the type 
Ce], although in the case of the front vowels there is a second burst 
stimulus possible at considerably lower frequencies (matching to some 
extent the varying Fl of the vowels). The burst cues for velar plosives 
tend to be the inverse of those for bilabials in the range 300—2,000 cps. 

A particular curvature (or transition) of F2 of the adjacent vowel also 
provides a primary cue for the distinction between the three types of 
English plosive articulation. Thus, in the case of the bilabials, F2 of a 
following vowel will typically curve upwards to its steady state and that 
of a preceding vowel downwards from its steady state (a minus transi- 
tion); the alveolars exhibit small minus transitions in F2 of an adjacent 
/1:/ vowel and large plus transitions in F2 with an adjacent /u:/; the 
characteristic transitions associated with the velars seem to be largely 
the inverse of those for the alveolars, F2 having plus transitions but the 
greatest plus transition being related to an open vowel of the /a:/ type. 
Such considerable variation in cues for velar plosives, both in the 
release burst and in the transitions, would appear to reflect the exten- 
sive articulatory variation associated with the English velar plosives (see 
§8.2.6). 

Any plosive + vowel or vowel + plosive sequence is, therefore, 
characterized by the curving of F2 of the vowel in a typical direction, as 


! See Leigh Lisker, ‘Closure Duration and the Intervocalic Voice-Voiceless Distinction 
in English,’ Language, vol. 33, no. 1, 1957, in which it is shown that intervocalic /b/ has 
an average duration of 75 msecs compared with an average of 120 msecs for an inter- 
vocalic /p/. 
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though from, or to, a fixed point in the spectrum known as the ‘locus’. 
This point is defined experimentally when several vowels are employed. 
The transition does not, however, extend from the steady state to the 
locus, but merely points in the direction of the latter. It would appear 
from experiments with synthesized speech that the best recognition is 
achieved if the first half of the transition from plosive to vowel consists 
of silence; if the curve is-extended too far, glides of the type /], w/ may 
be perceived. Research suggests that the transition of F2 and of F3 
adjacent to the vowel is related to the place of articulation, whereas the 
nature of Fl is related to the manner of articulation.! 


8.2.3 The Release Stage of English Plosives 


It is not always the case that plosives in English have a third stage’ con- 
sisting of a sudden oral release of air, either in the form of aspiration or 
as an immediately following vowel. The main variants are: — 

(1) No audible release in final positions.—In final positions, as in 
map, mat, mack, or robe, road, rogue, the closure stage may be main- 
tained, the air compression becoming weak and the release being 
achieved by a gentle, delayed, and relatively inaudible opening of the 
oral closure; or the compressed air may be released nasally and rela- 
tively inaudibly by lowering the soft palate and delaying the separation 
of the organs forming the oral closure. When an audible third stage is 
missing, the plosive is sometimes termed ‘incomplete’. The absence of 
an audible release stage entails the loss of the release noise burst as acue 
to the identification of the plosive. Unreleased final bilabial, alveolar, 
and velar plosives will, therefore, be distinguished mainly by the 
transitional features of the preceding sound. The sensitivity of English 
listeners to such cues is proved by the high percentage of correct 
discrimination between such pairs as mat, mack, or road, rogue, 
presented without a context, even when the final plosive is not released. 
The fortis series /p, t, k/ will, of course, be distinguished in final 
positions from the lenis series /b, d, g/ either by the reduction of length 
of the sounds preceding /p, t, k/ or by the presence of some voicing 
in /b, d, g/, or by a combination of both factors. The non-release of 
final plosives is a feature of colloquial RP. Careful speakers, how- 
ever, tend to release such plosives audibly and those who, in ordinary 


Il For details of research in this field see publications listed in the Bibliography under the 
names of: Delattre, Libermann, Copper, Lisker, and Harris; Joos; Potter, Kopp, and 
Green; Halle, Hughes and Radley; Fry and Denes; Borden and Harris. 

2Some writers (notably G. F. Arnold in ‘Concerning the theory of plosives,’ Le Maitre 
Phonétique, no. 125, 1966) argue that this stage should be separated into a ‘release’ (an 
articulatory feature which is the converse of the ‘closure’) and a ‘plosion’ (an auditory 
feature). Such an additional distinction can be of value in describing the special cases 
considered in paragraphs (1)—(5) of this section. 
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conversational style, use the unexploded variety will often use an 
audible release in more formal circumstances. (See §11.6 on stylistic 
variation.) 

(2) No audible release in stop clusters.—It 1s also a feature of most 
kinds of English that in a cluster of two stops (plosives or plosive + 
affricate) either within a word or at word boundaries, the first plosive 
has no audible release, e.g. in dropped (/p/ + /t/), rubbed (/b/ + 
/d/), white post (/t/ + /p/), good boy (/d/ + /b/), locked (/k/ + 
/t/), big boy (/g/ + /b/), object (/b/ + /d3/), great joke (/t/ + /d3/), 
big chin (/g/ + /tf/). In those languages where plosives in such 
situations are released audibly, the result is an intervening [h] in the case 
of voiceless plosives and an obscure vowel of the [a] type in the case of 
voiced plosives. In English the closure for the second stop is made 
before the release of the first, forming a further obstacle to the air- 
stream if the second closure is at a more advanced point, e.g. /t/ + /p/ 
in white post, or checking the air pressure if the second closure is at a 
more retracted point, e.g. /t/ + /k/ in white cat. Release of the first 
plosive is also delayed in cases of gemination, e.g. top people (/p/ + 
/p/), good dog (/d/ + /d/), big girl (/g/ + /g/), the second (hold) 
stage, consisting of silence or voice, being prolonged. The same delay in 
the release occurs when plosives, which are homorganic but different in 
fortis-lenis terms, are in sequence, e.g. top boy (/p/ + /b/), white dog 
(/t/ + /d/), big car(/g/ + /k/); in these cases, cues to recognition may 
be provided by the onset or cessation of voice, by the aspiration charac- 
teristic of the fortis series in syllable initial positions, and by the 
influence on the duration of preceding sounds of syllable final lenis 
plosives. It should also be noted that, in addition to the omission of an 
audible third stage of the first plosive in clusters, the first stage (on- 
glide, transition) of the following stop is also inaudible. Thus, in 
sequences of three plosives,! e.g. wept bitterly (/p/ + /t/ + /b/), 
locked door (/k/ + /t/ + /d/), jogged by (/g/ + /d/ + /b/), the 
central plosive has no audible first or third stage; when this position is 
occupied by /p, t, k/, the plosive is manifested only by a silence of a 
certain duration, i.e. the length of its second stage. It will be seen that 
the past tense termination /t/, following /k/ or /p/ and not released in 
a final position, as in worked, slipped, may provide no audible clue as 
to the tense; to a lesser extent, this is also true of /d/ following /b/ or 
/g/, as in robbed, tugged. In such cases, the verbal tense is supplied by 
the context, 

(3) Glottal reinforcement? of final /p, t, k/. It is increasingly typical 
of many types of British English that final /p, t, k/, in such words as 


l1.e. when the alveolar plosive is articulated in such sequences. (See §11.3.6.) 
¿The term ‘reinforcement’ is used by J. D. O’Connor in ‘RP and the reinforcing glottal 
stop’, English Studies, xxxiii, 1952. 
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shop, shot, shock, have the oral closure reinforced by a glottal closure 
[7]. In some cases this glottal coincides in time with the oral closure, 
inhibiting much of the air-pressure behind the oral closure, whether or 
not this latter is released audibly; in others the glottal closure may 
slightly anticipate the articulation of the oral obstruction; in other, 
rarer, cases there may be some compression of the air between the 
glottal and oral closures by means of the raising of the larynx and a con- 
striction of the pharyngeal cavity, resulting in a potential ejective 
release. In such a case the plosive is no longer glottally reinforced or 
glottalized but is instead produced using the egressive glottalic (or 
pharyngeal) airstream mechanism. In certain cases, too, [?] may replace 
poet, k7, see §8.2.7. 

(4) Nasal release.—When a plosive is followed by the homorganic 
nasal consonant, either syllabic or initial in a following syllable, the 
release of air is normally effected not by a removal of the oral closure, 
which is retained, but by the escape of the compressed air through the 
nasal passage, opened by the lowering of the soft palate for the nasal 
consonant, e.g. /p/ + /m/ topmost, /p/ + [m] sometimes in happen 
[heepm], /b/ + /m/ submerge, /b/ + [m] sometimes in ribbon 
[nbm], /t/ + /n/ chutney, /t/ + [n] cotton, /d/ + /n/ madness, /d/ 
+ [n] sudden; and, more rarely, /k/ + /9] thicken ['91kn], /g/ + [D] 
organ ['9: gn]. The same release takes place when the plosive and 
homorganic nasal occur at word boundaries, e.g. cheap meat, robe 
mistress, not now, red nose, etc. (Since /n/ does not occur initially in 
syllables, this last generalization does not apply to /k/ and /g/.) 

A different kind of nasal release occurs when the nasal consonant 
following a plosive is not homorganic, e.g. in cheap nuts, rub now, 
nutmeg, bad man, black magic, big nose, big man, etc. In these cases 
the plosive closure is not normally released until the articulatory move- 
ments for the nasal consonant, 1.e. the second oral closure and the 
lowering of the soft palate, have been accomplished. 

(5) Lateral release. —The most frequent tongue contact for English 
/1/ being alveolar, the sequences /t/ or /d/ + /l/ are homorganic (i.e. 
made at the same place of articulation). /t/ and /d/ in such situations 
are normally released laterally, 1.e. one or both sides of the tongue are 
lowered to allow the air to escape, the tongue-tip contact remaining. 
Such a release occurs whether the following /1/ is syllabic, e.g. in cattle, 
medal, or if it is initial in the next syllable or word, e.g. in atlas, at last, 
regardless, bad light. Such homorganic lateral release is to be distin- 
guished from sequences of /p, b, k, g/ + /l/, e.g. in apple, up late, 
bubble, blow, rub lightly, tackle, clean, blackleg, glow, eagle, big lad. 
In these cases, the partial alveolar contact for /1/ is made before or at 
the time of the release of the plosive and, in this sense, the escape of air 
is lateral; but since /p, b/ and /k, g/ may be released in a truly lateral 
way, 1.e. by the removal of one or both sides of the bilabial or velar 
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closure, the term ‘lateral release’ is best reserved in English for the 
homorganic alveolar + /l/ sequences. Such true lateral releases must 
be taken as typical of English usage, there being no intervening removal 
of the tongue contact on the alveolar ridge, such as would result in 
aspiration or an obscure vowel. Pronunciations of the type ['lit"#, 
(‘mid*t] for little, middle, are frequently to be heard in the speech of 
children. 

(6) Affrication of plosives.—If the release of plosive closures is not 
made rapidly, a fricative sound, articulated in the same area of articula- 
tion as the plosive, witl be heard; plosives made with this slow, fricative 
release are said to be affricated. Common realizations of the English 
plosives /p, b, t, d, k, g/ might, therefore, be followed by brief frica- 
tives Of the types [0, fi, s, Z, x, Y]. In some varieties of English the 
alveolars /t, d/ may frequently be heard in affricated form [t*, d7]: in 
strongly accented positions, e.g. in time, day; in relatively weakly 
accented positions, e.g. in waiting, riding; and in final positions, e.g. in 
hat, bed. (Note that, in these last two examples, the forms [t*] and [d?] 
differ from the realization of the plural terminations /t/ + /s/ and /d/ 
+ /z/ mainly in the brevity of the friction associated with the affricated 
plosives.) Affrication is also occasionally heard with the velar plosives, 
i.e. [k*] and [g], e.g. in hesitant or emphatic speech in accented 
situations in such words as come, good, or, more commonly with /k/, 
in weakly accented or final positions, e.g. in talker, talk. /p/ and /b/ 
are rarely affricated. Affrication of plosives, especially of the alveolars, 
is particularly characteristic of popular London speech. 

It should also be noted that in rapid, familiar speech, where easy 
intelligibility rather than articulatory precision is the aim, the closure, 
of plosives is often so weak that the corresponding fricative sound, 
without a preceding stop, is produced, especially in weakly accented 
intervocalic positions. The following examples have been noted among 
educated speakers; imported [im'p>:sid], invaded [in'veizid], baker 
['beixa], dagger [‘deeya] (this latter, on the stage, in the Macbeth 
‘dagger’ soliloquy), and even pepper [‘peda], rubber ['raBa]. 

(7) Advice to foreign learners.—All the foregoing variants of the 
hold and release stages of English plosives may be heard from RP 
speakers. A foreign speaker of English may be generally intelligible 
without adopting any of these features, such is the redundancy of 
information carried in the English utterance. But the foreign learner 
who aims at a near approximation to the speech of English natives 
should adopt the following features at least: — 

(a) Inaudible release of plosives preceding other plosives or 
affricates. l 

(b) Nasal release of plosives followed by a homorganic nasal, 
especially /t, d/ + /n/. 

(c) Lateral release of /t, d/ + /I/. 
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(d) Affrication of /p, t, K/ as a stage in learning aspiration of these 
plosives in strongly accented positions. 


8.2.4 Bilabial Plosives 
/p, b/ 
(1) Examples 


/p/—fortis (regularly spelt with p; note ‘hiccough’ /‘hikap/, and silent 
pin ‘pneumonia, psalm, ptarmigan, receipt, cupboard’, etc.) 
accented, aspirated—pin, pill, pain, appear, impatient; play, pray, 
pew 

accented after /s/, unaspirated—spin, spill, Spain, spear; splay, 
spray, spew 

weakly accented, relatively unaspirated—upper, capable, oppor- 
tunity, gospel; simply, apricot, champion 

syllable final—cheap, lip, lap, shape, lisp, pulp, pump; upright, 
chaplain, upward 

with no audible release—captain, topcoat, wiped, hop picker, top 
boy, top girl, top dog, ripe cheese 

followed by nasal consonant—topmost, halfpenny, happen, cheap 
meat 

followed by lateral consonant—apple, couple, please, up late 

/b/—lenis (regularly spelt with b; note silent bin ‘limb, thumb, comb’, 

etc., and ‘debt, subtle, doubt’) 

initial, partially devoiced—big, boast, banana, begin; blow, brain, 
beauty 

intervocalic,' voiced—rubber, labour, harbour, husband, symbol 

final, voiceless—rib, ebb, sob, robe, bulb 

with no audible release—obtain, rubbed, subconscious, sob bitterly, 
sub-prefect, Bob goes, object 

followed by nasal consonant—submerge, robe mistress, ribbon 

followed by lateral consonant—bubble, blow, rub lightly 

Compare /p/, /b/—post, boast; peach, beach; rapid, rabid; dapple, 

dabble; sopping, sobbing; simple, symbol; cup, cub; rope, robe; 
plead, bleed; pray, bray; puke, rebuke; mopped, mobbed. 


(2) Description.—The soft palate being raised and the nasal reso- 
nator shut off, the primary obstacle to the air stream is provided by the 
closure of the lips. Lung air is compressed behind this closure, during 
which stage the vocal folds are held wide apart for /p/, but may vibrate 
for all or part of the compression stage for /b/ according to its situation 
in the utterance. The air escapes with force when the lip closure is 


li.e. in the sense of ‘between voiced sounds’. 
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FIG. 28.—/p, b/. 


released, unless the air-stream has been blocked by a second closure at a 
point behind the lips (as for /t/) or has been diverted through the nose 
by the lowering of the soft palate (as for /m/); when a lateral sound 
follows, the air-stream will have a lateral escape round the point of 
alveolar closure. In those cases where a bilabial plosive precedes a labio- 
dental sound (/f, v/), as in cup-full, obvious, the stop 1s often made by 
a labio-dental rather than a bilabial closure, in anticipation of the 
following fricative articulation, thus [kapful], [obvias]. Tongue move- 
ments involved in vowels or consonants adjacent to the bilabial stop are 
made independently of the lip closure, e.g. the /9:/ tongue position is 
maintained through the /b/ closure in four balls and the /l/ alveolar 
contact through the /p/ closure in helpless. 

(3) Variants.—No important variants of /b/ occur, except in respect 
of the amount of voicing in initial and final positions, full voicing in 
either position being rare. On the other hand, many speakers tend to 
devoice in intervocalic positions, retaining the lenis nature of the con- 
sonant. In the same way, the amount of aspiration given to /p/ varies 
between speakers, though the accented form will always tend to be 
more strongly aspirated than the unaccented form (see $$8.2.1, 
8:23). 

(4) Chief Sources. —PresE /p/ and /b/ develop regularly from the 
same OE phonemes (single and geminated) and from French /p/ and 
/b/. In some cases /b/ derives from earlier /p/, e.g. lobster, pebble, 
and /p/ from earlier /b/, e.g. pudding, purse, gossip. 

(5) Advice to foreign learners. —See general remarks in $$8.2.1, 
8.2.3, and examples for practice in (1) of this section. 
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8.2.5 Alveolar Plosives 


tad 
(1) Examples 


/t/—fortis (regularly spelt ¢, tf; sometimes with th, e.g. ‘Thames, 
Thomas’; also -ed in verbal past tenses and participles after fortis 
consonants other than /t/, e.g. ‘jumped, looked, laughed, 
guessed, pushed’; ¢ silent in ‘castle, Christmas’, etc.) 

accented, aspirated—take, tall, tone, attend, obtain; try, between, 
tune 
accented after /s/, unaspirated—steak, stall, stone 
weakly accented, relatively unaspirated—butter, letter, after, tax- 
ation, phonetic; entry, antler, outward 
syllable final—beat, boat, late, past, sent, halt, tuft, rushed, act, 
fetched 
with no audible release—outpost, hatpin, football, catgut, white tie, 
that dog, white chalk, great joke 
with homorganic nasal release—cotton, button, eaten, not now 
followed by /m/—nutmeg, utmost, that man! 
with homorganic lateral release—little, cattle, atlas, at least 
/d/—lenis (regularly spelt d, dd) 
initial, partially devoiced—do, dog, double, date; dry, dwindle, duke 
intervocalic, voiced—leader, order, adorn, hiding, London, elder, 
under, middle, sundry, fiddler, endways 
final, voiceless—bid, mad, road, rubbed, bend, old, loved, bathed, 
raised, judged 
with no audible release—head boy, head girl, bad pain, red car, good 
dog, bed time, good judge, good cheese 
with homorganic nasal release—sudden, madness, red nose 
followed by /m/—admit, road map 
with homorganic lateral release—middle, padlock, headless, badly, 
good luck 
Compare /t/, /d/—town, down; latter, ladder; water, warder; 
written, ridden; metal, medal; fated, faded; sat, sad; wrote, road; 
kilt, killed; bent, bend; train, drain; twin, dwindle; tune, dune 
/t/, /6/—tin, thin; taught, thought; eater, ether; fort, fourth; tent, 
tenth; welt, wealth 
/d/, /0/—dough, though; day, they; den, then; udder, other; 
loading, loathing; breed, breathe; side, scythe 


(2) Description.—The soft palate being raised and the nasal reso- 


11f the alveolar plosive is articulated as such. See §§8.2.7(2)(b)(3), 11.3.5. 
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Fic. 29.—/t, d/. 


nator shut off, the primary obstacle to the air-stream is formed by a 
closure made between the tip and rims of the tongue and the upper 
alveolar ridge and side teeth. Lung air is compressed behind this 
closure, during which stage the vocal folds are wide apart for /t/, but 
may vibrate for all or part of the compression stage for /d/ according to 
its situation in the utterance. The lip position for /t/ and /d/ will be 
conditioned by that of the adjacent sounds, especially that of a follow- 
ing vowel or semi-vowel, e.g. spread lips for /t/ in feeth, anticipatory 
lip rounding for /t/ in tooth, twice. The air escapes with force upon the 
sudden separation of the alveolar closure, unless the airstream has been 
blocked by a second closure either behind the alveolars (as for /k/) or 
forward of the alveolars (as for /p/), or unless it has been diverted 
through the nose by the lowering of the soft palate (as for /n/); if the 
release is lateral, only part of the alveolar obstruction is removed, the 
tongue-tip contact remaining. 

The alveolar stop contact is particularly sensitive to the influence of 
the place of articulation of a following consonant. Thus, followed by 
/r/ as in try, dry, the contact will be post-alveolar ft, d]; followed by 
/8, 0/ as in eighth, not that, dental [t, d]; followed by /f/ as in cheap, 
reach, accompanied by a considerable anticipatory raising of the front 
of the tongue. In addition, word final /t, d/ assimilate readily to /p, k/ 
and /b, g/, when followed by word initial bilabial and velar consonants 
(see SIL 34) 

(3) Variants.—In addition to the general plosive variations com- 
mented on in §§8.2.1, 8.2.3, it should be noted that /t, d/ are especially 
liable to affrication (particularly in the south of England) and even 
replacement by the equivalent fricative in weakly accented situations, 
e.g. time [t'am], important [im'ps:tont]. In Irish English, /t, d/ may 
be strongly dentalized and affricated before /r/. For some speakers, 
and generally in American English, /t/ is realized in weakly accented 
intervocalic positions as a lenis, rapid tap resembling a /d/ or one tap 
[c], e.g. butter, latter, put it over there. Increasingly, /t/ in syllable final 
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positions is reinforced or replaced by a glottal closure unless a vowel or 
syllabic /n/ or /1/ follows, e.g. late, want, cricket, outright, chutney. 
Some RP speakers will also use [?] to replace /t/ when [n] follows, 
e.g. cotton, certain; but the use of [?] for /t/ preceding [1], and more 
particularly in unaccented intervocalic positions, is typical of regional 
varieties of English (e.g. those of popular London and Glasgow), as in 
kettle, butter, later, a lot of (see §8.2.7 for [?]). 

The instability of alveolar articulations is further demonstrated by 
the ease with which /t/ or /d/ may be elided in consonantal clusters (see 
§9.6 for examples within words and §11.3.6 for examples at word 
boundaries). 

(4) Chief sources.—PresE /t/ and /d/ derive from the same 
phonemes of OE (single and geminated) and from French introductions 
with /t/ and /d/. Some words, now spelt with th and pronounced with 
/8/, had /t/ until eModE, e.g. throne, orthography, diphthong, 
authority. There are, in addition, many cases in PresE of established 
elision of an earlier /t/ or /d/, e.g. in castle, hasten, Christmas, often, 
fasten, Hertfordshire, Wednesday, handsome, in which words the /t/ 
or /d/ was probably sounded up to the seventeenth century. In other 
cases /t, d/ have been added to earlier French stems ending in /n/, e.g. 
peasant, parchment, sound, astound, and to certain English and 
French words after a fortis fricative, e.g. against, amongst; graft. It 
should also be noted that the past tense and participle termination -ed 
(earlier [əd]) of weak verbs assimilated to /t/ following fortis 
consonants—other than /t/—on the loss of the intervening weak 
vowel, e.g. wrapped, missed, annexed, pierced, blessed, etc. 

(5) Advice to foreign learners.—In addition to the general remarks 
in §§8.2.1, 8.2.3, and the examples for practice given in (1) of this 
section, it is to be emphasized for foreign learners that the general arti- 
culation of /t, d/ is an alveolar one, made with the tongue-tip raised. 
The corresponding phonemes of many other languages have a dental 
rather than an alveolar point of contact. Those learners who carry over 
from their own language a dental articulation should practise the 
slightly affricated forms of /t, d/, i.e. [t*], [d7], in words such as time, 
day. If the closure point remains dental, the affrication produced will 
be clearly of the [t°], [d°] type. Those learners who, in their own 
language, have two varieties of stop closure made with the tongue-tip, 
e.g. speakers of Indian languages having dental and post-alveolar or 
retroflex varieties, should avoid using their retroflexed plosives in 
English, since these sound over-retracted to the English ear; they should 
also avoid using their dental /t, d/ for English /@, 07. 
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8.2.6 Velar Plosives 


/k, g/ 


(1) Examples 


/k/—fortis (spelt k; c, cc + a, o, u; qu; ch, e.g. ‘kind, cake, accord, 





conquer, stomach, chemist, bouquet’; qu = /kw/ in “quiet, 
quart’, etc.; silent c or k in ‘muscle, knew, knit’, etc.) 

accented, aspirated—come, car, kin, incur, according; cry, clean, 
quick, queue 

accented after /s/, unaspirated—scum, scar, skin 

weakly accented, relatively unaspirated—income, baker, talking, 
biscuit, anchor; secret, duckling, equal, dockyard 

syllable final—leak, duck, rock, choke, bank, bulk, desk 

with no audible release—Blackpool, locked, blackboard, thick dust, 
black cat, dark grey, deckchair, lock-jaw 

followed by nasal consonant—acknowledge, dark night, thicken 
(sometimes /61kn/), black magic 

followed by lateral consonant—buckle, clean, close, blackleg 

/g/—lenis (regularly spelt, g, gg; sometimes gh, gu, e.g. “ghost, guard”; 
g silent in ‘gnaw, gnat, diaphragm, sign, reign’, etc.) 

initial, partially devoiced—go, geese, guess, girl; glass, grass, Gwen 

intervocalic, voiced—eager, hunger, figure, ago, begin, eagle; 
juggling, angry, anguish, argue 

final, voiceless—dog, leg, rogue, vague 

with no audible release—rugby, begged, bagpipes, wagtail, big 
game, eggcup, big jaw, big chin 

followed by nasal consonant—dogma, ignore, quagmire, big man, 
drag-net, organ-grinder 

followed by lateral consonant—bugle, struggle, glow, wriggling, 
dog lead 

Compare /k/, /g/—cap, gap; coat, goat; clue, glue; decree, degree; 
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Fic. 30.—/k, g/ + /11f. 
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FIG. 31.—/k, g/ + /D7. 
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bicker, bigger; stacker, stagger; lacked, lagged; ankle, angle; 
hackle, haggle; pick, pig; back, bag; duck, dug; crate, great 


(2) Description.—The soft palate being raised and the nasal resona- 
tor shut off, the primary obstacle to the air-stream is formed by a 
closure made between the back of the tongue and the soft palate. Lung 
air 1s compressed behind this closure, during which stage the vocal folds 
are wide apart for /k/, but may vibrate for all or part of the compres- 
sion stage for /g/ according to its situation in the utterance. The lip 
position will be conditioned by that of adjacent sounds, especially 
following vowels or semi-vowels, e.g. spread lips for the plosives in 
keen, geese, and somewhat rounded lips for the plosives in cool, goose, 
quick. The air escapes with force upon the sudden separation of the 
linguo-velar closure, unless the air-stream has been blocked by a second 
closure forward of the velum (as for /p/ or /t/), or has been diverted 
through the nose by the lowering of the soft palate (as for /n/); when a 
lateral sound follows, the air-stream will have a lateral escape round the 
point of alveolar closure. 

The velar stop contact is particularly sensitive to the nature of an 
adjacent vowel (especially a following vowel). Thus, when a front 
vowel follows, e.g. /1:/ in key, geese, the contact will be made on the 
most forward part of the soft palate and may even overlap on to the 
hard palate; when a back vowel follows, /D/ in cof, gone, the contact on 
the soft palate will be correspondingly retracted; a contact in the central 
region of the soft palate is made when a vowel of a central type follows, 
e.g. /A/ or /3:/ as in come, gun, girl (see Figs. 30, 31). 

(3) Variants.—The actual extent of advancement or retraction of the 
velar closure will depend upon the nature of the extreme vowels /1:/ 
and /b/ used by an individual; if /i:/ has a very front and tense articula- 
tion, the /k, g/ closures will in turn be near palatal, i.e. [c, J] (noted by 
John Wallis in the seventeenth century as ky, gy), whereas, if /D/ is of 
the most back and open variety, the velar closure will be of the most 
retracted kind. (For other variations affecting all plosives, see §§8.2.1, 
8.2.3.) Since the initial clusters /kl, gl/ as in clean, glean, are not in 
opposition with /tl, dl/ which do not occur initially, a substitution of 
/tl, dl/ for /kl, gl/ in such positions may occasionally be heard both in 
RP and in other forms of English. 

(4) Chief sources.—PresE /k, g/ derive from the same phonemes in 
OE (in single and geminated forms) and in French introductions. Some 
English forms with /k/ (spelt with c or ck) as in pocket, carpenter, 
derive not from the affricates usual in Central French (pochette, 
charpentier) but from the plosive equivalents of Northern French 
dialects. Again, some early French borrowings with qu retain the 
original /kw/ pronunciation, e.g. quit, squadron, where modern 
French has lost the /w/ element (cf. the later borrowing bouquet). 
Occasionally, a learned c spelling has been introduced with a resultant 
/k/ pronunciation, e.g. perfect, subject (note that in victuals, despite 
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the introduction of a c in the spelling, no /k/ is pronounced). The 
phoneme /g/ is sometimes spelt gu before e, i, in words of English 
origin, e.g. guest, guilt, by analogy with words of Central French origin 
such as guide, guerdon. /k, g/ initially before /n/ (know, gnaw) were 
finally lost late in the seventeenth century; /g/ following /n/ was lost 
in the south of England, e.g. in sing, rung, etc., in the seventeenth 
century, though it is kept intervocalically in certain words, e.g. finger, 
longer, single, and in all positions in many Midland and Northern 
dialects (see §8.6.3 for the origins of /p/). 

(5) Advice to foreign learners.—Note the general remarks in 
§§8.2.1, 8.2.3, and the examples for practice given in (1) of this section. 
French learners should be particularly careful not to over-palatalize 
/k, g/ both before and after front vowels; Spanish learners should 
avoid reducing intervocalic /g/ to a fricative [y] or pronouncing initial 
/g/ (especially before a back vowel) as [gw] or [w]. 


8.2.7 Glottal Plosive 
[2] 


(1) Description.—In the case of the glottal plosive (stop), the obstruc- 
tion to the air-stream is formed by the closure of the vocal folds, 
thereby interrupting the passage of air into the supra-glottal organs. 
The air pressure below the glottis is released by the sudden separation of 
the vocal folds. The compression stage of its articulation consists of 
silence, its presence being perceived auditorily by the sudden cessation 
of the preceding sound or by the sudden onset (often with an accom- 
panying strong breath effort) of the following sound. The plosive is 
voiceless and must be assigned to the fortis category both because of the 
strong air compression involved and also because, when substituted for 
or reinforcing a fortis plosive in final positions, e.g. in shape, feet, 
back, it has the same effect of reducing the length of the preceding 
sounds as a fortis consonant. The articulation of [?] must be distin- 
guished from that type of glottalization or laryngealization which 
involves tension in the laryngeal region and either an excessively slow 
rate of vibration of the vocal folds (‘creaky voice’) or a vibration of the 
false vocal folds situated just above the true vocal folds. In the pro- 
duction of these latter sounds, often heard in the lowest pitches of 
intonation and associated with weak intensity (though sometimes with 
muscular tension, e.g. at the lower level of the fall-rise tune) or on 
almost any pitch level in certain affected voice qualities, there is no total 
closure of the vocal folds. 

It is clear from the description given above that there is no acoustic 
manifestation of the glottal plosive other than the abrupt cessation or 
onset of the adjacent sounds. 
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(2) Usage.—The glottal plosive, though frequently used by RP 
speakers, is not a significant sound in the RP system. Moreover, a 
distinction must be made between its regular occurrence in RP, certain 
tendencies for its more extended use in RP, and special cases which are 
found exclusively in regional speech. 

(a) General RP usage.'—[?] serves regularly for many RP speakers 
as a syllable boundary marker, when the initial sound of the second 
syllable is a vowel. Thus, a hiatus of vowels belonging to different 
syllables (especially when the second vowel is accented), may in careful 
speech he separated by [?] instead of being joined by a vocalic glide, e.g. 
co-operate, geometry, reaction [kau ?pporeit, d31?bmotri, ri'?ækfn], 
and even when the second vowel is weakly accented, e.g. day after day 
[dei a: fta dei]. This usage of [?] is extended amongst careful speakers 
to those cases where there is a danger of an intrusive /r/ (see §11.3.7) at 
a point of vowel hiatus, e.g. in Shah of Iran, law and order, drama and 
music; the glottal marker is in turn applied by some speakers in cases 
where a regular linking /r/ is permissible, e.g. in /ater on, far off, four 
aces, but where some inhibition is imposed by the fact that the final 
vowel in the first word is /9, a:/ or /o:/, i.e. those associated with 
intrusive /r/ sounds. In these latter cases, and in those cases of true 
hiatus where the second vowel is weakly accented, the glottal action 
may be one of unusual stricture rather than of total closure. 

Finally, any initial accented vowel may be reinforced by a preceding 
glottal stop when particular emphasis is placed on the word, whatever 
the preceding sound, e.g. in /t’s [?] empty, I haven’t seen [?] anybody, 
She’s [?] awfully good; or again, any vowel, initial in an accented 
syllable, may receive this glottal reinforcement, e.g. It’s un [?] eatable, 
such dis|?]lorder. 

(b) Extended usage in RP.—(i) Reinforcement of final fortis 
plosives. —As was pointed out in §8.2.3(3), word final /p, t, k/ and also 
/tf/ may be reinforced by a glottal closure which may coincide with the 
mouth closure or slightly precede it, especially when the word is final in 
the utterance, or, medially in the utterance, when the following word 
begins with a consonant. The release of such a glottal closure takes 
place at the same time as, or slightly preceding, that of the mouth 
closure (if the mouth closure is audibly released), so that the final 
release is not normally ejective. This appears to be true whatever the 
vowel or continuant consonant preceding the fortis stop, as the follow- 
ing examples of possible reinforced stops illustrate: — 


! See also: P. Christophersen, ‘Glottal Stop in English,’ English Studies, xxxiii, 1952; 
J. D. O’Connor, ‘RP and the reinforcing Glottal Stop,’ English Studies, xxxiii, 1952; 
B. S. Andrésen, ‘The Glottal Stop in the Received Pronunciation of English,’ Univer- 
sitetet i Bergen, Arbok, 1958; Preglottalisation in English Standard Pronunciation, Oslo, 
1968. 
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Ae Dy uel? + /k/ + /tf/ 
io leap feet leak reach 
INE rip bit lick rich 
a sharp cart lark arch 
IN/ cup cut luck much 
/30/ rope boat joke coach 
nasal limp went bank branch 
/\/ help belt bulk mulch 
/s/ lisp list risk — 
/f/ — lift — — 


Note 


(1) The glottal closure is perceived most abruptly when it is a question 
of cutting off a preceding short vowel. 

(2) In the case of the stop being preceded by a fortis fricative (/f, 0, s, 
f/), the glottal reinforcement appears usually to be simultaneous with 
the oral closure rather than anticipating it. 

(3) Inthe case of word or morpheme final fortis clusters, e.g. /ps, pt, 
ts, ks, kt, tft/ in corpse, apt, writes, axe, act, reached, the glottal 
reinforcement is applied to the first element of the cluster and may 
continue into the second, if this is also a plosive. 


Such reinforcement is also sometimes heard in utterance-medial 
positions. This is the case when the word medial or final stop is made by 
/tf/ or one of the clusters mentioned above and is followed by a vowel, 
e.g. creature, reach it, watch it, drops it, dropped it, liked it, likes it, fits 
it. Itis very rare for reinforcement to be heard in association with word 
final /p, t, k/ + vowel. 

In those final positions where reinforced /p, t, k/ are not released, 
the type of plosive is identifiable by means of the preceding vowel 
transition; when [?] precedes the plosive, thereby cutting off much of 
the vowel transition, the plosive is frequently released. 

(11) Replacement of /p, t, k/ by [?].—Some RP speakers replace 
word or morpheme final /p, t, k/ by [?] when a consonant follows, no 
oral closure being made. Such a glottal closure often replaces /t/ when 
the following consonant is homorganic, i.e. /t/, /d/, /tf/, or /d3/ asin 
that table, get down, that chair, great joke, or /n/ as in witness, not 
now, or (less frequently) /1/ as in Scotland, at least; but [?] is also heard 
for /t/ before other plosives, e.g. in football, gatepost, cat-call, catgut, 
and somewhat less commonly before nasals, e.g. not time, nutnieg, 
fricatives, e.g. Catford, not for me, not very, what thing, out there, 
outset, great zeal, nutshell, and /r, w, j/, e.g. outright, cart-wheel, not 
yet. Indeed, pure [?] 1s to be heard for /t/ before all non-syllabic 
consonants except /h/. It is more common before a stressed syllable 
e.g. at last [97 la:st] is more common than Scotland ['sko?land] and 
outran [au? reen] more common than outright ['au?rait]. Before /h/, 
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[2] for /t/ is comparatively rare, e.g. not here, boathouse. In the case of 
following syllabic /1/, i.e. where /1/ has a vowel function, e.g. in little, 
the use of [?] for the preceding /t/ is not generally accepted in RP, 
although it is sometimes to be heard for /t/ before syllabic /n/ as in 
cotton, Eton. 

The replacement of final /p, k/ by [?] is much less frequent among 
RP speakers, except when the following consonant is homorganic, e.g. 
soappowder, cap-badge, back-garden, bookcase. The use of [?] for 
/p, k/ in such cases as cup-full, reptile, capture, upset, lecture, back- 
door, breakfast, is not generally acceptable as RP. 

RP speakers may occasionally be heard to use [?] for the [t] element 
of final /tf/, e.g. in coach, much, catch, couch. 

(c) Usage of regional speech.—The replacement of /t, k/, and more 
rarely of /p/, in final positions by [?], without any mouth closure, is 
typical of several kinds of regional speech, e.g. popular London, as in 
Mind your feet [mdi doze 'fə1?], Have a look ['zv 9 lu?], get up [,ged 
‘A?]. Such replacements result in the neutralization of oppositions 
otherwise signalled by vowel transitions or release bursts, e.g. between 
type, tight, tike, the loss of this information being made sood by other 
cues supplied by the context. 

Very commonly, in this kind of regional speech, an unaccented /t/ is 
replaced by [?] between vowel or /n, 1/ and vowel, and between vowel 
and syllabic /n, YY, e.g. in daughter, butter, Saturday, Waterloo, 
writing, potato [po tæ1?9], salty, wanted, mutton, little, or within the 

hrase as in not in [no? 'in], but what he ought to do [bə?'wo? i: 3:? ə 
dau]. In rapid speech, however, especially when a number of articula- 
tions come in quick succession, the glottal closure is likely to be very 
weak. 

Glottal replacements of /p, k/ also occur in similar situations, e.g. in 
supper, paper, cup of tea [ ka? a 'ts91], lucky, joker, he doesn’t like it 
[91 'd36?'la1? 17], but there appears to be a greater tendency to retain a 
bilabial or velar closure. In cases of /-mpl, -ntl, -9kl/, as in simple, 
mental, uncle, if the nasal consonant is articulated, the [?] used for 
/p, t, k/ is likely to be accompanied by the already formed bilabial, 
alveolar, or velar closure; if, however, as often happens in London 
speech, /-1m, -en, -An/ are realized as [i, €, A], the following stop may be 
only glottal. 

In London speech, [?] also replaces the fricative /f/, especially in the 
phrase half a, e.g. half a minute [ad a ‘mini?]. It should also be noted 
that, initial /h/ often being elided, vowels thus rendered initial may 
have glottal emphatic reinforcement applied to them, especially in 
hiatus with a preceding vowel, e.g. I hate him [ai'?ze1? im], we haven’t 
[wi ?zevn?]. 

(3) History.—Since it would appear that [?] has never been a signifi- 
cant sound in English, it is not to be expected that its stylistic use should 
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have been described in detail by the early grammarians. It is, however, 
mentioned in the seventeenth century! as a feature of the onset of initial 
vowels and, in works dealing with singing technique, has traditionally 
been described as the ‘hard attack’. But the substitution of [?] for a 
fortis plosive in regional speech is not explicitly mentioned until the 
nineteenth century and it is only in recent years that the phenomenon of 
reinforcement has been explicitly noted. Lack of descriptive evidence 
concerning this non-significant sound 1s not, however, a reason for 
assuming that [?] is a feature of recent occurrence in English speech; we 
have little information concerning intonation patterns before the 
second half of the eighteenth century, but it is unlikely that the language 
has not made use of pitch distinctions in the utterance from the earliest 
times. 


8.3 Affricates 


l. Definition.—The term ‘affricate’? denotes a concept which is 
primarily of phonetic importance. Any plosive whose release stage is 
performed in such a way that considerable friction occurs approxi- 
mately at the point where the plosive stop is made, may be called 
‘affricative’. It should be noted that the plosive and homorganic 
fricative will have the same voicing. The friction present in an affricate 
is of shorter duration than that which characterizes the fricatives 
proper. In English, apart from the exceptional affrication mentioned in 
$8.2.3(6), only /t, d/ may have this type of release, namely in /tf, d3, tr, 
dr, ts, dz, t8, dð/. 

2. Phonemic status.—From a functional or distributional point of 
view these compound sounds may be considered either as single 
phonemic entities or as sequences of two phonemes. As always, the 
choice of phonemic solution will depend upon the purpose of the 
analysis, but the following factors may be taken into account:— 

(1) The distribution of the sound complex, especially in the follow- 
ing situations:— 


(a) syllable initial 

(b) syllable final, within the same syllable or morpheme 

(c) word medial, with close-knit stop and fricative elements 

(d) word medial or final, with stop final and fricative initial in 
adjacent syllables or morphemes. 


A sound complex which has a general distribution and shows an 
opposition between close-knit medial realizations and disjunct (i.e. 


'See D. Abercrombie, ‘Forgotten Phoneticians’, Transactions of ihe Philological 
Society, 1948. 
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with the elements in separate syllables or morphemes) realizations may 
be treated as a complex phonemic entity. 


(a) (b) (c) (d) 
/tf/ chap patch butcher lightship 
/d3/ jam badge aged — 
r tram — mattress footrest 
/dr/ dram — tawdry handrail 
/ts/ ~~ quartz curtsey(?) cats, outset 
/dz/ — adze Pudsey(?) roads 
/t8/ = — = eighth 
/d0/ — — — (width) 


The table above shows that /tf, d3/ best fulfil these conditions, 
behaving in an intervocalic position very much as a simple consonant, 
1.e. without separation of the elements between the syllables (cf. 
pitches, pities; aged, aided). /tr, dr/ lack occurrences in final positions, 
but have a high degree of close-knit distinctiveness as a complex. /ts, 
dz/ do not occur initially! (except in rare foreign words) and only 
doubtfully in close-knit medial situations. /t8, dd/ have an occurrence 
restricted to the final position in very few words. Thus, /tf, d3/, and to 
a lesser extent /tr, dr/, may be treated as single, complex, phonemes, 
such an interpretation being particularly valuable in the teaching of RP 
to most nationalities. 

If /tf, d3, tr, dr/ are treated as sequences of two phonemes, the 
realization of the second element will differ according to whether it 
occurs in the same syllable or morphemeas the stop (1.e. shorter friction 
in the case of /tf, d3/ and fricative /r/ in the case of /tr, dr/) or whether 
a syllable or morpheme boundary occurs between the elements (i.e. 
longer friction in the case of /t/ + /J/ anda less fricative /r/ in the case 
of /t, d/ + /r/—no boundary being possible between /d/ and /3/, 
since /3/ does not occur in syllable or morpheme initial position). 

(2) Possibilities of commutation of the elements.—(a) The elements 
of /tf/ may be commutated within the same syllable: word initially, in 
the stop, with zero only, cf. chip, ship; in the fricative, with /r, w/ and 
zero, cf. chip, trip, tip, chin, twin”; word medially, in the stop, with 
zero only (a syllable boundary being possible between /I/ and /f/ in 
such a case as welshing, and the medial sequence /-nf/ being an alter- 
native form of /-ntf/ as in pinching), cf. matches, mashes; in the 


1 The different length of friction as between the fricative element of an affricate and a 
fricative following a plosive is shown by a comparison of the affricated /t/ in cat [keet*] 
and the longer friction of the plural form cats /keets/. Dialectal affrication of /t, d/ is, 
however, more common initially (where /t, d/ + /s, z/ is unusual) than finally, being 
inhibited in final positions by the risk of confusion with the inflected forms. 

2 There is a tendency for initial /tj/ to become /tf/ as it does medially; oppositions 
between initial /tj/ and /tf/ are rare, but cf. the first syllable of Tuesday with choose. 
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fricative, with /r/ or zero (a syllable boundary occurring between /t/ 
and /w/, /1/, /n/, etc., as in outward, atlas, chutney), cf. enchants, 
entrance (v.), marcher, martyr; word finally, in the stop, with /l/ or 
zero (final /-nf/ as in French being an alternative for /-ntf/), cf. watch, 
Walsh, wash, and in the fricative, with /s/ or zero, cf. catch, cats, cat. 

The elements of /d3/ have a more restricted possibility of commuta- 
tion owing to the rarity of syllable initial /3/. Thus, the elements com- 
mutate within the same syllable: word initially, in the fricative only, 
with /r, }, w/ and zero, cf. jest, dressed, June, dune (witha tendency for 
/dj/ to become /d3/), dwell, jell, jam, dam; medially, in the stop, with 
zero, cf. ledger, leisure, and in the fricative, with /r/ and zero in the 
same syllable, cf. orgy, Audrey, larger, larder; word finally, in the 
fricative only, with /z/ or zero, cf. hedge, heads, head (final /3/ being a 
variant of /d3/). 

Thus the possibilities of commutation are restricted in the case of the 
elements of /tf/ to zero (and occasionally /1/) for the stop, and to zero, 
/r, w/ and /s/, according to the situation, for the fricative. The com- 
mutability of the elements of /d3/ 1s also restricted, 1.e. with zero in the 
case of the stop (medially), and with zero, /r, j, w/ or /z/, according to 
the situation, in the fricative. Moreover, /tf/ isin opposition to /d3/ as 
a complex in all positions (see §8.3.1 for examples). 

(b) /tr, dr/, on the other hand, have considerable possibilities of 
commutation especially in the first element: in the case of /tr/, cf. try, 
cry, pry, fry, rye; true, shrew, drew, grew, threw, brew; tree, three, tea; 
trill, chill, twill; Troon, tune; train, chain; in the case of /dr/, cf. drew, 
true, crew, grew, brew, threw, shrew, rue, do, due, Jew; dry, fry, pry, 
etc. 

On the basis of commutability, therefore, /tr, dr/ are more reason- 
ably to be considered as consisting of separable elements than /tf, d3/. 

(3) Native speakers’ reaction.—It seems that the native speaker does 
not regard /tf, d3/ as composite sounds, 1.e. composed of distinctive 
elements. He is likely, for instance, to consider that chip, catch, consist 
of three parts in the same way as tip, ship, or cat, cash; or again, jam, 
badge, as structures equivalent to dam, bad. (It is, of course, true that 
PresE /tf, d3/ derive in many cases from earlier (OE or OF) plosives [c] 
or [y], as well as from a coalescence of /t/ or /d/ + /j/—see §8.3.1— 
but this is irrelevant in any consideration of the present structure of the 
language.) On the other hand, /tr, dr/ are not normally regarded as 
anything but sequences of /t, d/ + /r/ and, in many dialects where the 
/r/ has a tap or trill realization, there is no question of affrication. 

(4) Conclusion.— In view of the considerations stated above, /tf, 
d3/ will here be treated as complex phonetic but single phonemic 
entities. /tr/ and /dr/ will also be considered as units, but mainly on 
account of the phonetic relationship of the elements, 1.e. the retracted 
nature of the [t] and the friction associated with the [1] in general RP, 
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and because, in teaching non-English learners, the treatment of RP 
/tr, dr/ as single articulations is an effective pedagogic procedure. 

3. Acoustic features.—The acoustic features of affricates are those 
appropriate to stops (see §8.2.2) and fricatives (see §8.4). Since, how- 
ever, the release stage is fricative, the most essential perceptual cues will 
be provided by the transition between the preceding vowel and the stop 
and by the explosive onset of the friction. Nevertheless, in the case of 
/tf, d3, tr, dr/, the transition will not be that which is typical of the 
alveolar plosives, since the stops of /tf, d3/ will be of a palatalized type 
and those of /tr, dr/ of a post-alveolar kind; there may be also, in the 
case of /tf, d3/, a brief intervening friction of the alveolar /s, z/ type 
before the [f, 3] elements proper.' 


8.3.1 Palato-alveolar Affricates 
/tf, d3/ 
(1) Examples 


/tf/—fortis (spelt ch, tch, t + ure, eous, and t + ion when fis preceded 
by s, e.g. “chain, watch, nature, righteous, question?) 
word initial—cheese, chain, charge, charm, choke, cheer 
word medial (intervocalic)—feature, richer, wretched, orchard, 
butcher, nature, merchant 
(consonant preceding)—gesture, posture, mischief, juncture, 
capture, lecture, pilchard, culture, adventure 
word final—wretch, catch, larch, porch, much, coach 
(consonant preceding)—inch, conch, bench, branch, filch, 
mulch 
/d3/—lenis (spelt j, g, dg, sometimes gg, dj, de, di, ch, e.g. ‘jam, gem, 
midget, suggest, adjacent, grandeur, soldier, Norwich’) | 
word initial—gin, jest, jar, jaunt, Jew, jerk, joke, joist, jeer 
word medial (intervocalic)—midget, ledger, margin, fragile, urgent, 
orgy, adjacent, agenda, major 
(consonant preceding)—avenger, danger, stringent, soldier, 
Belgian, bulges, object 
word final—ridge, edge, large, dodge, judge, huge, age, doge, gouge 
(consonant preceding)—bilge, bulge, hinge, sponge, change 
Compare /tf/, /d3/—chin, gin; chest, jest; choose, Jews; choke, joke; 
cheer, jeer; catches, cadges; nature, major; a venture, avenger; 
riches, ridges; leech, liege; larch, large; perch, purge; lunch, lunge; 
cinch, singe; beseech, besiege 


1See P. Strevens, ‘Spectra of Fricative Noise in Human Speech,’ Language and 
Speech, vol. 3, 1, 1960. 
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FIG. 32.—Stop phase of /tf, d3/. 


(2) Description.—The soft palate being raised and the nasal reso- 
nator shut off, the obstacle to the air-stream is formed by a closure 
made between the tip, blade, and rims of the tongue and the upper 
alveolar ridge and side teeth. At the same time, the front of the tongue is 
raised towards the hard palate in readiness for the fricative release. The 
closure is released slowly, the air escaping in a diffuse manner over the 
whole of the central surface of the tongue with friction occurring 
between the blade/front region of the tongue and the alveolar/front 
palatal section of the roof of the mouth. During both stop and fricative 
stages, the vocal folds are wide apart for /tf/, but may be vibrating for 
all or part of /d3/ according to the situation in the utterance. (/d3/ 
shares the features of devoicing in initial and final positions exhibited 
by plosives, see §8.2.1(4), and fricatives, see §8.4(3).) /tf, d3/ differ 
from plosives in that they never lose their fricative release stage. The lip 
position will be conditioned by that of adjacent sounds, especially that 
of a following vowel (cf. the greater lip-rounding of /tf/ in choose in 
relation to that of cheese), though with some speakers a certain amount 
of lip-protrusion is always present. 

In addition, it should be noted that the fortis /tf/, when final in a 
syllable, occasions pre-fortis clipping, having the same effect of 
reducing the length of preceding sounds as was noted for /p, t, k/ (see 
§8.2.1(5)); comparatively full length of preceding sounds is retained 
before /d3/. This effect must be taken as a primary distinctive feature 
of the /tf/-/d3/ opposition in final positions. 

Since there is no opposition between /tf, d3/ and /tr, dr/ in syllable 
final positions, the palatalization of /tf, d3/ is not significant in such 
situations; the articulation of post-alveolar affricates in such cases as 
touch, large, will generally be perceived as /tf, d3/. 

(3) Variants. —No important variants of /tf, d3/ occur, except in the 
matter of the degree of lip-rounding used. Some very careful speakers, 
however, use /t/ and /d/ + /j/ in words which frequently have /tf/ or 
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/d3/ e.g. gesture, culture, virtue, statue, righteous, fortune, literature, 
question, posture, Christian, soldier, grandeur. Little confusion arises 
from the retention of /tj/ and /dj/ forms, since oppositions between 
medial /tf, d3/ and /tj, dj/ are rare, e.g. a potential case is verdure with 
/dj/ and verger with /d3/. In the case of /t, d/ + u, both the palato- 
alveolar affricate and /t/ or /d/ + /j/ may be heard, e.g. in actual, 
punctual, mutual, obituary, individual, gradual, educate. Some 
speakers omit the stop element in the clusters /ntf, nd3/ in word final 
positions as in pinch, French, lunch, branch, paunch, hinge, revenge, 
challenge, strange, scrounge, etc., and also medially as in pinching, 
luncheon, avenger, danger, etc. 

(4) Chief sources.—PresE /tf, d3/ derive from: early OE [c] and [J] 
(child, chin, kitchen, teach, church, edge, bridge), this change being 
accomplished by the late OE period; OF [tf, d3] (chief, chair, chamber, 
choice, merchant, branch, judge, major, age, village, change); acoales- 
cence of medial /tj/ or /dj/ (nature, virtue, question, creature, 
grandeur, soldier)—this latter change was not general until the early 
eighteenth century; a number of eighteenth-century coalesced forms 
have now reverted to a stop + /j/ or /1/ sequence, e.g. piteous, bestial, 
tedious, odious. 

(5) Advice to foreign learners.—Some learners (especially Scandina- 
vians) are apt to articulate /tf, d3/ with too much lip-spreading and 
over-palatalization, producing sounds resembling /tc, dj]. Particular 
attention should also be paid to the shortening of sounds preceding 
syllable final /tf/, the examples in (1) of this section providing practice 
for this feature. Moreover, sequences of affricates should be practised, 
care being taken to pronounce the fricative elements of both affricates, 
e.g. which chair; Dutch cheese; large jar; Judge Jeffreys. 


8.3.2 Post-alveolar Affricates 
/tr, dr/ 
(1) Examples 


/tr/— fortis (spelt tr; note ‘naturally’ often /'nætrəli/, and /trı/ < 
Ari, e.g. ‘history, territory’, etc.) 
word initial—tree, trick, trend, trod, true, truck, tripe 
word medial (initial in syllable)—attract, poetry, petrol, sultry, 
mattress, poultry, pastry, country, entrance (n.) /entrans/, 
entrance (v.) /in'tra:ns/ 
/dr/—lenis (spelt dr; note /dri/ < /dari/, e.g. ‘boundary’, etc.) 
word initial—dream, drip, dram, drop, drawl, drum, draft, drove, 
drought, dreary, Drury 
word medial (initial in syllable)—address, adroit, hydrangea, 
tawdry, sundry, hindrance, Andrew 
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FIG. 33.—Stop phase of /tr, dr/. 


Compare /tr/, /dr/—trip, drip; trench, drench; tram, dram; trunk, 
drunk; troop, droop; try, dry 
/tr/, /tf/—trees, cheese; trip, chip; trap, chap; true, chew; 
train, chain 
/dr/, /d3/—drill, gill; dressed, jest; draw, jaw; drew, Jew; 
dram, jam; drear, jeer; Drury, jury 


(2) Description.—In the pronunciation of RP /tr, dr/, the soft 
palate being raised and the nasal passage shut off, the obstacle to the 
airstream is formed by a closure made between the tip and rims of the 
tongue and the rear edge of the upper alveolar ridge and the upper side 
teeth. The centre of the tongue is hollowed in readiness for the [1] type 
friction, which results from the slow release of the stop (see §8.8 for the 
articulation of /r/). During the stop and fricative stages, the vocal folds 
are wide apart for /tr/; in the case of /dr/, voice is present throughout 
the affricate when medial, but may be associated only with the fricative 
element when initial. /tr, dr/ do not occur in syllable final position. The 
lip position is conditioned mainly by that of the following vowel, 
though some speakers will always use a certain amount of lip protrusion 
typical for /r/. A feature of the close-knit complexes /tr, dr/ is the 
fricative nature (voiced or voiceless) of the [1] element, which in this 
respect differs from the usual realization of RP /r/. 

(3) Variants. —In those regional types of English where the normal 
/r/ is realized as a lingual or uvular tap or trill, the post-alveolar type of 
stop + fricative complex described above may not be used, but rather a 
clear sequence of stop + trill or tap (see §8.8 for varieties of /r/). 

(4) Chief sources. —Except in the case of elision of intermediate /3/ 
(see $9.6), /tr, dr/ derive from similar earlier complexes. 

(5) Advice to foreign learners. —For those foreign learners whose 
ownr sound is either a lingual trill or a uvular trill or fricative, it is often 
helpful to approach the English RP /r/ through the affricate complexes 
/tr, dr/, thus establishing the correct place of articulation. Learners 
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should start from /tf, d3/ and retract and hollow the tongue until 
suitable /tr, dr/ affricates are achieved; the comparative examples 
given in (1) of this section will serve as practice material for this 
exercise. 


8.4 Fricatives 


In the articulation of a fricative consonant, two organs are brought 
and held sufficiently close together for the escaping air-stream to pro- 
duce local air turbulence; fricatives are, therefore, like plosives and 
affricates, characterized by a noise component. This turbulence may or 
may not be accompanied by voice. There is an on- or off-glide in respect 
of an adjacent sound, most appreciable if the adjacent sound is a vowel. 

The RP fricative phonemes comprise four pairs /f, v; 8,0; s, z; f,3/ 
and /h/. [x], a voiceless (fortis) velar fricative, occurs exceptionally in 
some speakers’ pronunciation of Scottish words such as loch; for [0 , f3, 
S, Z, X, Y] occurring as the fricative element of affricated plosives, see 
$8.2.3(6), and for the fricative allophones of /r, j, w/, see $$8.8-8.9.2. 

The following words illustrate oppositions, especially between 
members of the fricative pairs, in word initial, medial, and final 
positions: — 


Initial. Medial. Final. 
/f/ feel proofing leaf 
/v/ veal proving leave 
/0/ thigh earthy, ether wreath 
/0/ thy worthy, either! wreathe 
/s/ seal racer peace 
ae zeal razor peas 
/{/ sheet fission, Confucian ruche 
ae gigolo vision, confusion rouge 
/h/ heat behave — 


These oppositions may be realized by means of one or several of the 
following phonetic features: — 

(1) Place of articulation.—/f, v/—labio-dental; /6, 6/—dental; 
/s, z/—alveolar; /f, 3/—palato-alveolar; /h/—glottal. Such a series 
must be considered to be relatively complex. The existence, in partic- 
ular, of place-oppositions between the dental, alveolar, and palato- 
alveolar areas of articulation necessitates a precision of articulation in 
English which is not required in languages lacking, for instance, either 
the dental or the palato-alveolar pairs. Thus, the lack of palato-alveolar 
fricative phonemes in Spanish permits the retraction of articulations in 
the alveolar region, e.g. /S/; whereas the absence in French and many 


l If pronounced as /i:09/—rarer than / a109/. 
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other languages of dental fricative phonemes allows a dentalized 
quality in the alveolar articulations, which is liable in English to cause 
confusion with /8, 0/ or to produce a ‘lisping’ fricative which is con- 
sidered socially undesirable. 

(2) Force of articulation.—Within the four pairs, /f, 6, s, f/ tend to 
be pronounced with relatively more muscular energy and stronger 
breath force than /v, 0, z, 3/; the former are fortis, the latter /enis.' /h/ 
is normally fortis in character, but may have a lenis allophone (see 
§8.4.6). 

(3) Voicing. —Like the lenis plosives and affricates, /v, 0, z, 3/ tend 
to be fully voiced only when they occur between voiced sounds, e.g. in 
cover, other, easy, leisure, a van, all that, by the zoo. In initial and 
(especially) in final positions, the lenis fricatives may be only partially 
voiced or completely voiceless, e.g. initially in van, that, zoo (i.e. with 
silence preceding) only the latter part of the friction is likely to be 
voiced, and finally in leave, breathe, peas, rouge /ru:3/ (i.e. with 
silence following) the friction is typically voiceless, though the con- 
sonant remains lenis—[y, 0, Zz, 3]. The fortis series is voiceless in all 
positions. /h/, however, Occurring only in word initial and medial 
situations, though voiceless in an initial position, may have some 
voicing medially between voiced sounds, e.g. anyhow. 

(4) Length of preceding sounds.—Whien /f, 8, s, f, v, Ò, Z, 3/ occur 
finally, their value is determined largely (since the voicing factor is not 
strongly operative) by the length of the syllables which they close.? Like 
all fortis consonants, /f, 8, s, J/ have the effect of reducing the length 
of the preceding vowel (particularly a long vowel or diphthong) and of 
/1,m, n/ interposed between the vowel and the fricative, cf. fife, loath, 
place, leash, self, pence, with five, loathe, plays, (liege), selves, pens, 
etc. The same pre-fortis clipping is operative when the fortis fricatives 
occur in a medial position, although here a contrastive voicing feature is 
likely to be present in /v, 0, z, 3/, cf. proofing, proving; earthy, worthy; 
racer, razor; fission, vision. 

Surmmmary.—The RP fricatives may, therefore, be said to be dis- 
tinguished: — 

(a) by means of a five-term Series in respect of place of articulation— 
labio-dental v. dental v. alveolar v. palato-alveolar v. glottal, and 

(b) at each of the first four points of articulation, by an opposition 
between fortis and lenis in all contextual situations; by the possibility of 
some voicing in the lenis members in initial positions; by an opposition 
of presence and absence of voice in medial positions; by an opposition 
of the length of preceding sounds. 


l But see §8.5 concerning ‘fortis’ and ‘lenis’ as phonological categories. 
?P. Denes, ‘Effect of Duration on the Perception of Voicing,’ Journal of the 
Acoustical Society of America, 27, 261 (1955). 
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Foreign learners, quite apart from using the correct mouth articula- 
tions, should pay particular attention to the degree of voicing in the 
lenis series and to the influence which the fortis series has on the length 
of preceding sounds. 


8.4.1 Acoustic Features of English Fricatives 


In acoustic terms,’ our perception of the various types of fricative 
(whose characteristic feature is a continuous noise component) appears 
to depend upon the following factors: — 

(1) Extent and position of noise component. —Continuous noise in 
the spectrum is appropriate to articulatory friction regions: — 


alveolars— a noise range from about 3,600—8,000 cps 
palato-alveolars— di o le 2,000—7,000 cps 
labio-dentals— Él úl a 1,500—7,000 cps 
dentals— És iS f 1,400 — 8,000 cps 
glottal— a ds j 500 —6,500 cps 


(2) Intensity of noise component.—/s, J/ have relatively high inten- 
sity; /f, 0, h/ relatively low intensity. The lenis series has an overall 
lower intensity than that of the fortis series. 

(3) Low frequency component.—The lenis series may have a 
periodic low frequency component (voicing) which is absent in the 
fortis series (see §3.1.4 and P. Denes, op. cit.) 

(4) Formant transitions. —Especially in the case of the low intensity 
labio-dentals and dentals, most information comes from the nature of 
the adjacent vocalic glide. In the case of /h/ (often an anticipatory 
voiceless version of the following vowel), the spectral pattern is likely to 
mirror the formant structure of the following vowel. 

(5) Duration of fricative noise.—The friction of the lenis series is 
shorter than that of the fortis series. 


8.4.2 Labio-dental Fricatives 
(1) Examples ° 


/f/— fortis (spelt f, ff, ph, gh, e.g. ‘fork, off, physics, enough’) 
word initial—feet, fit, fat, father, fool, fail, photo 


l See K. S. Harris, ‘Cues for the Identification of the Fricatives of American English’, 
J.A.S.A., 26, 952, 1954, also ‘Cues for the Discrimination of American English Frica- 
tives in Spoken Syllables’, Language and Speech, vol. 1, 1, 1958; D. B. Fry and P. Denes, 
‘The Solution of some Fundamental Problems in Mechanical Speech Recognition,’ 
L. and S., vol. 1, 1, 1958; P. Strevens, ‘Spectra of Fricative Noise in Human Speech,’ 
L. and S., vol. 3, 1, 1960; G. W. Hughes and M. Halle, ‘Spectral Properties of Fricative 
Consonants,’ J/.A.S.A., 28, 303, 1956. 
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word medial—affair, defend, offer, tougher, loafer, suffer, selfish, 
comfort 
word final—leaf, laugh, cough, stuff, roof, loaf, strife 
in word initial clusters—fry, fly, few, sphere 
in word final clusters—/f0(s)/ fifth(s), /ft(s)/ raft(s), /mf(s)/ 
triumph(s),! /lf(s)/ wolf(s), /1ft/ engulfed, /If0(s)/ twelfth(s), 
/fn(z, d)/ soften (s, ed), /fl(z, d)/ baffle(s, ed), /fs/ coughs. 
/v/—lenis (spelt, v, f, ph, e.g. ‘vine, of, nephew’) 
word initial—veal, vex, vat, vast, vain, vice, voice 
word medial—ever, nephew, over, silver, cover, event, canvas 
word final—leave, give, have, of, move, dove, grove 
in word initial clusters—/vj/ view 
in word final clusters—/vz/ loaves, /vd/ loved, /vn(z)/ oven(s), 
/\v(z, d)/ solve(s, d), /vl(z, d)/ grovel(s, ed), etc. 
Compare /f/, /v/—fine, vine; fat, vat; few, view; offer, hover; sur- 
face, service; laugher, larva; camphor, canvas; leaf, leave; proof, 
prove; safes, saves 


(2) Description.—The soft palate being raised and the nasal reso- 
nator shut off, the inner surface of the lower lip makes a light contact 
with the edge of the upper teeth, so that the escaping air produces 
friction. The actual point of contact will vary somewhat according to 
the adjacent sound, e.g. in the case of a back strongly rounded vowel or 
of a bilabial plosive (fool, roof, obvious), the contact on the lower lip 
tends to be more retracted than in the case of a front spread vowel ( feel, 
leaf). For /f/, the friction is voiceless, whereas there may be some vocal 
fold vibration accompanying /v/, according to its situation (see 
§8.4(3)). The tongue position of an adjacent vowel will persist or be 
anticipated during the labio-dental friction; in the case of intervocalic 
/f, v/, the tongue will articulate independently for the vowels or, if the 
vowels are similar, e.g. in stiffest, giving, will retain its position during 
the labio-dental friction. 

(3) Variants.—No important articulatory variants for /f, v/ occur 
among RP speakers, although word final /v/ may assimilate to /f/ 
before a fortis consonant initial in the following word, e.g. regularly in 
have to and more rarely in such sequences as /ove to, have some (see 
§11.3.3) or may, in familiar speech, be elided in the case of the 
unaccented form of of, have,e.g.inalot of money, [ could have bought 
it /a loto mani, al kad a 'bo:t 1t/, where /9/ is phonetically equivalent 
to the unaccented form of are, a (see 811.1).? In West Country speech, 
the fortis /f/ is often replaced by a weaker articulation approaching 
that of /v/. 


l An epenthetic /p/ may be inserted, thus /tralampfs/ see §8.4.4(3). 
2Note the accepted forms of man-of-war, tug-of-war, o'clock, will-o’-the-wisp, with 
/3/ rather than /av/ for of. 
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FIG. 34.—/f, v/. 


(4) Chief sources. —PresE /f/ derives from OE initial and final [f] 
(foot, fowl, free, thief, leaf, wife) and from OF [f] (fine, fruit, profit, 
chief). The spelling ph next to th (as in diphthong) was pronounced 
with /p/ rather than /f/ in eModE, and often spelt with p; such a pro- 
nunciation with a bilabial plosive is still widely heard. An earlier final 
[x] following a rounded vowel gave [f] by late ME (dwarf, laugh, rough, 
cough); in words such as plough, dough, bough, the PresE pronuncia- 
tion derives from the inflected form in which [x] has been lost, as also in 
the final [xt] sequence (ought, thought). An irregular change [u] > [v] 
> [f] has taken place in the word lieutenant, though the regular 
development /l(j)u:'tenant/ is used in the Navy and in American 
English. The increasing use of /f/ in nephew is a spelling pronuncia- 
tion, instead of the earlier, regular /v/. 

PresE /v/ derives from OE /f/ = [v] between voiced sounds (love, 
devil, wolves, wives, driven), from SW dialect initial OE [v] for [f] (vat, 
vixen, vane) and from OF [v] (vain, very, cover, serve). 

(S) Advice to foreign learners.—Some learners (particularly 
Indians) use too weak a contact for /v/, so that the friction is lost, 
giving the labio-dental frictionless continuant [v]; others (particularly 
Germans and Hungarians) use bilabial friction [f3] instead of the labio- 
dental sound. In both of these cases, there is a tendency to use the same 
sound for both /v/ and /w/. Care should, therefore, be taken to distin- 
guish such pairs as vain, wane; verse, worse; vest, west, etc., using 
strong friction between the lower lip and upper teeth for /v/. In addi- 
tion, attention should be paid to the degree of voicing in /v/ according 
to its situation and to the length of sounds preceding /f/ (see §8.4(3), 
(4), and examples in (1) of this section). 
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8.4.3 Dental Fricatives 
70.07 


(1) Examples 


/8/—fortis (spelt always th) 
word initial—thief, thick, thatch, thong, thought, thumb 
word medial—ether, ethics, lethal, method, author, anthem, lengthy, 
atheist, athletic, deathly, worthless 
word final—heath, smith, breath, path, cloth, earth, fourth, oath 
in word initial clusters—three, throw, thew, thwart 
in word final clusters—/®t/ earthed, /8s/ mouth’s, /p8(s)/ depth(s), 
/t0(s)/ eighth(s), /f0(s)/ fifth(s), /ksO(s)/ sixth(s), /m8/ warmth, ! 
/n8(s) month(s), /If@(s)/ twelfth(s), /nk8(s)/ length(s), /10(s)/ 
health(’s), /01(z)/ Ethel(’s), /@n/ earthen 
/0/—lenis (spelt always th) 
word initial—there, this, then, the, though, thy, they 
word medial —breathing, leather, gather, father, mother, northerly, 
southern, worthy, either, although 
word final—seethe, with, soothe, lathe, clothe, writhe, mouth (v.) 
in word final? clusters—/óm(z)/ rhythm(s), /On(z)/ southern(’s), 
/ðl(z)/ betrothal(s), /dz/ clothes, /dd/ writhed, /d0/ width (if 
pronounced /widó/) 
Compare /8/, /0/—thigh, thy; ether, breather; earthy, worthy; 
wreath, wreathe; mouth, mouth (v.); oath, clothe; truth’s, truths 
/8/, /s/—thick, sick; thought, sort; thumb, sum; mouth, 
mouse; worth, worse 
/0/, /t7—thick, tick; thought, taught; three, tree; heath, heat; 
both, boat; fourth, fort 
/0/, /z/—seethe, seas; lathe, laze; clothe, close (v.); breathe, 
breeze 
/0/, /d/—then, den; though, dough; there, dare; other, udder; 
worthy, wordy; seethe, seed; writhe, ride 


(2) Description.—The soft palate being raised and the nasal resona- 
tor shut off, the tip and rims of the tongue make a light contact with the 
edge and inner surface of the upper incisors and a firmer contact with 
the upper side teeth, so that the air escaping between the forward 
surface of the tongue and the incisors causes friction. With some 
speakers, the tongue-tip may protrude between the teeth. For /6/ the 
friction is voiceless, whereas for /0/ there may be some vocal fold 
vibration according to its situation (see §8.4(3)). The lip position will 


| An epenthetic /p/ may be inserted, thus /w>:mp0/, See $8.4.4(3). 
2/9/ does not oecur in a word initial cluster. 
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FIG. 35.—-/0, 0/. 


depend upon the adjacent vowel, e.g. being spread for thief, heath, 
these, etc., and somewhat rounded for thought, truth, soothe, etc. 

(3) Variants.—No important RP variants of /8, 0/ occur, but since 
/8, O/ offer difficulties of articulation when followed by /s, z/, they are 
sometimes elided, e.g. clothes /klauz/, months /mans/, or /mants/. In 
sequences of the type /s, z/ followed by unaccented /0/ as in Is there 
any?, What’s the time?, the preceding alveolar articulation may 
influence the dental fricative in rapid speech—/'1z zar ‘enl, wots za 
'taim/. Again, in popular London speech, the difficulties of the dental 
articulation may lead to their replacement by labio-dental fricatives, 
e.g. throw it, Smith /'frau it, smif/, mother, breathe in /‘mava, 'bri: v 
'm/. In this kind of speech, too, other alveolar articulations may be 
heard for the weak /0/, e.g. all the way /'9:1 da 'we1/, in the morning 
/in na 'mornin/. 

(4) Chief sources.—PresE /0/ derives from initial and final 
(including morpheme final) OE [8] (think, throat, bath, tooth, earthly, 
worthless), from OF [0] (faith), from learned Greek forms (theory, 
thesis) and others which until ModE commonly had /t/ (catholic, 
authority, theatre). 

PresE /0/ derives from OE /0/ = [0] between voiced sounds (other, 
feather, breathes). Note that of the English words where there is initial 
/0/ rather than /8/ (which is to be expected from the OE forms), e.g. 
the, this, then, that, than, they, etc., most are words which are 
frequently medial and unaccented in an utterance. 

(5) Advice to foreign learners.—Foreign learners should avoid using 
/t/ or /s/ for /8/, and /d/ or /z/ for /0/. In particular, those words 
with /0/ which are normally unaccented, e.g. the, than, they, etc. 
should not be pronounced with /d/. The difficulty of /8, 0/ lies not so 
much in their articulation, which most learners can perform correctly in 
isolation, as in their combination with other fricatives, especially /s/ 
and /z/. Learners should, therefore, practise with drills containing 
such combinations involving rapid tongue glides, e.g. /s + 0/ this 
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thing, sixth, /z + 8/ his thumb, /s + 0/ pass the salt, /z + 0/ is this it?, 
/0 + s/ fifths, /8 + s + 0/ Smith’s there, /Ó + z + 0/ soothes them, 
etc., /s, z/ preceding /8, 0/ should never be allowed to assimilate to /0, 
0/ (see §11.5). 


8.4.4 Alveolar Fricatives 
1S, 27 
(1) Examples 


/s/—fortis (Spelt $, ss, €, SC, X (= /ks/), e.g. “so, pass, niece, science, 

axe’) 

word initial—cease, sat, sample, soon, soap, sign, soil 

word medial—pieces, losses, essay, axes, concert, escape, pencil, 
whisper, wrestler, excite, useless 

word final—niece, farce, pass, puss, goose, famous, dose, ice, 
mouse, fierce, scarce 

in word initial clusters—/sp/ spare, /st/ stain, /sk/ scarce, /sm/ 
smoke, /sn/ snake, /sl/ slow, /sf/ sphere, /sw/ swear, /sj/ sue, 
/spl/ splice, /spr/ spray, /spj/ spume, /str/ stray, /stj/ stew, 
/skr/ scream, /Skj/ skewer, /skw/ square 

in word final clusters—/sp(s, t)/, gasp(s, ed), /st(s)/ rest(s), /sk(s, t)/ 
ask(s, ed), /sm/ lissom, /sn(z)/ listen(s), /sns/ licence, /sl(z)/ 
muscle(s), /ns(t)/ mince(d), /nsl(z)/ pencil(s), /Ist/ whilst, /snt/ 
decent, /ps(t)/ lapse(d), /mps(t)/ glimpse(d), /Ips/ helps, /ts/ 
cats, /kts/ acts, /pts/ opts, /Its/ faults, /nts/ tents, /Is/ pulse, 
/fts/ drafts, /ks(t)/ tax(ed), /nks/ thanks, /Iks/ milks, /mfs/ 
nymphs, /fs/ laughs, /8s/ fourths, /f@s/ fifths, /If6s/ twelfths, 
/n8s/ months, /ks8(s)/ sixth(s), /t@s/ eighths, /dst/ midst, /mpts/ 
prompts, /stl(z)/ pistol(s), /tns/ pittance, /dns/ riddance, /vns/ 
grievance, /fns/ patience, etc. 





FIG. 36.—/s, z/. 
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/z/—lenis (spelt $, ss, Z, ZZ, X (= /gZ/), e.g. ‘roses, Scissors, ZOO, dizzy, 

exact’) 

word initial—zeal, zest, zinc, ZOO, zone, zero 

word medial—easy, hesitate, bazaar, bosom, hawser, lazy, 
thousand, palsy, pansy, husband 

word final—fees, is, says, as, was, 00ze, does, butchers, gaze, rose, 
cows, noise, ears, airs, tours 

in word final clusters '—/bz/ ribs, /dz/ heads, /gz/ legs, /mz/ limbs, 
/nz(d)/ cleanse(d), /ndz/ hands, /nz/ rings, /lz/ holes, /vs/ caves, 
/\dz/ holds, /lvz/ valves, /lbz/ bulbs, /lmz/ films, /Inz/ kilns 
/0z/ clothes, /zm(z)/ prism(s), /zn(z, d)/ emprison(s, ed), /zl(z, 
d)/ puzzle(s, d), /zd/ raised, /zml/ dismal, /plz/ apples, /blz/ 
bubbles /tlz/ battles, /dlz/ saddles, /klz/ buckles, /glz/ eagles, 
/tflz/ Rachel's, /d3lz/ cudgels, /mlz/ camels, /nlz/ channels, 
/0lz/ Ethel’s, /slz/ thistles, /vlz/ evils, /flz/ ruffles, /flz/ 
officials, /tnz/ kittens, /dnz/ saddens, /fnz/ orphans, /vnz/ 
ovens, /snz/ hastens, /fnz/ oceans, /3nz/ visions, /Onz/ heathens, 
/znt/ present, /mplz/ samples, /mblz/ symbols, /zndz/ 
thousands, /ndlz/ sandals, /ntlz/ lentils, /nklz/ uncles, /nglz/ 
angles, /stlz/ pastels, /mzlz/ damsels, etc. 

Compare /s/, /z/—seal, zeal; sink, zinc; decease, disease; passing, 
parsing; fussy, fuzzy; racer, razor; peace, peas; loose, lose; use 
(n.), use (v.); gross, grows; place, plays; ice, eyes; house (n.), house 
(v.); scarce, scares: pence, pens; false, falls 


(2) Description.—The soft palate being raised and the nasal resona- 
tor shut off, the tip and blade of the tongue make a light contact with 
the upper alveolar ridge, and the side rims of the tongue a close contact 
with the upper side teeth. The air-stream escapes by means of a narrow 
groove in the centre of the tongue and causes friction between the 
tongue and the alveolar ridge. There is very little opening between the 
teeth. With some speakers, the tongue-tip is in contact with the lower 
teeth, so that friction is produced between the blade of the tongue and 
the alveolar ridge. For /s/ the friction 1s voiceless, whereas for /z/ there 
may be some vocal fold vibration, according to its situation (see 
§8.4(3)). The lip position will depend upon the adjacent vowel, e.g. 
spread for see, zeal, piece, bees, etc., and somewhat rounded for soon, 
zoo, loose, lose, etc. Some speakers make a light additional contact 
between the lower lip and the upper teeth, thus giving the sounds a 
secondary labio-dental quality. A lisp, i.e. a substitution of /8, 0/ for 
/s, z/ or a Strongly dentalized version of /s, z/, is a common speech 
defect or habit, despite the loss of phonemic opposition involved, 
which might be expected to exert pressure towards the maintenance of 
the dental/alveolar fricative distinction. 


1/z/ does not occur in an initial cluster apart from /zj/ in zeugma and Zeus. 
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(3) Variants.—Apart from the articulatory variants mentioned 
above, which are individual rather than social or regional, the fortis /s/ 
is often replaced in West Country speech by a weaker articulation 
approaching that of /z/. Before /r/, the approximation of the tongue 
to the alveolar ridge may be more retracted, e.g. in horse-riding, 
newsreel. Word final /s, z/ exhibit a readiness to assimilate before /f, j/ 
(see §§11.3.2, 11.3.5). 

Few RP speakers regularly maintain the distinction between /ns/ and 
/nts/ which is widespread in regional speech: e.g. distinguishing the 
final clusters in mince—mints, tense—tents, assistance—assistants, 
dance—plants, -/nts/ tending to be used in all cases. This plosive 
epenthesis, the insertion of /t/ between /n/ and /s/, results from the 
raising of the soft palate before the oral closure for /n/ is relaxed for the 
fricative /s/. Similarly, when /m/ or /n/ precedes the /s/, an 
epenthetic plosive homorganic with the nasal may occur, e.g. Samson 
/seemson/ > /seempson/, Kingston /kinston/ > /kimkston/, such 
variation being reflected in the spellings of proper names such as 
Sam(p)son and Sim(p)son.! On the other hand, /nz/ and /ndz/ are kept 
distinct by most RP speakers, e.g. in wins—winds, tens—tends, 
fines—finds, except in the most rapid speech when it is /d/ which may 
be elided. 

(4) Chief sources.—PresE /s/ derives from OE [s, ss] (soon, sun, 
kiss, mice, wasp) and from OF [s] (sudden, strange, lesson, beast, pace, 
false). 

PresE /z/ derives from OE /s/ = [z] between voiced sounds (rise, 
wise, thousand, wisdom) and from OF [z] (zeal, easy, dozen, cause). In 
addition, the weak termination [os] gives [z], on the loss of [a] by 
eModE, when following a voiced sound (/oves, dogs, lands, stones); 
where the vowel is retained—as /1/ in PresE—(e.g. after /s,-z, f, tf, 
d3/), the [z] fricative form is also used in the weak ending (passes, 
roses, rushes, touches, pages). In the same way, words like is, as, was, 
has, his, which occur most commonly in weakly accented positions, 
were pronounced with final [z] by e ModE. In French words, an earlier 
[s] occurring medially between weakly accented and strongly accented 
syllables often > [z] (resemble, possess, observe, disease, exact); but 
many exceptions occur, perhaps for reasons of analogy of the spelling 
pronunciation or because the [s] is felt to belong to a separable prefix or 
to be initial in a common word preceded by a separable prefix (e.g. 
assist, excite, dishonest, research). In words where the medial fricative, 
especially final in the prefix dis-, precedes a voiced consonant, both /s/ 
and /z/ are heard, e.g. in disdain, disgust. 

(5) Advice for foreign learners.—In many languages, especially 


! Similar epenthesis may take place in sequences of nasals + other voiceless fricatives. 
The epenthetic plosive is always homorganic with the nasal. 
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those where no dental fricatives exist, /s, z/ are articulated nearer to the 
teeth than the English varieties. Such a dentalized articulation is to be 
avoided in English because of the danger of confusion with /8, ð/ (both 
in terms of the phonemic opposition involved and of the difficulties of 
alveolar/dental clusters). The more retracted articulation for /s, z/ 
should be practised in opposition with /8, 0/ in such pairs as: sing, 
thing; sort, thought; close (v.), clothe; sees, seethe; mouse, mouth; use 
(n.), youth. On the other hand, those whose /s, z/ are often too 
retracted for English, e.g. Spaniards and Greeks, should practise 
oppositions between /s, z/ and /f, 3/: sin, shin; sort, short; lasses, 
lashes; Caesar, seizure; leased, leashed; mess, mesh. In addition, 
because /s, z/ occur so frequently in final positions, particular 
attention should be paid to the degree of voicing in /z/ (which should 
never be allowed to become a fortis consonant) and to the length of 
sounds preceding /s/ (see $8.4(3) and the examples in (1) of this 
section). 


8.4.5 Palato-alveolar Fricatives 


1, 3/ 
(1) Examples 


/{/—fortis (spelt sh, ch, sch, s or ss before u, -ti-, -Si-, -scl-, -Cl-, -Ce-, 
e.g. 'shoe, machine, schedule, sure, assure, nation, mansion, 
mission, conscience, special, ocean’; note: x in ‘luxury’ = /kf/) 

word initial—sheet, shed, shop, sugar, charade, shout 

word medial—Asia, bishop, ashore, mission, luscious, bushel, 
cushion, rashly, machine 

word final—dish, cash, wash, push, douche, rush, finish, ruche 

in word initial clusters '—/fr/ shrink 

in word final clusters—/1f(t)/ welsh(ed), /fn(z, d)/, fashion(s, ed), 
/{nt(s)/ patient(s), /nfn(z, d)/* mention(s, ed), /ft/ pushed, /fl(z, 
d)/ marshal(s, led); where /n/ precedes final /tf/, e.g. in bench, 
lunch, some speakers use a final cluster /nf/, without the [t] stop. 

/3/—lenis (spelt -si-, s, z before u and, in French loan words, final 
-2£€, e.g. ‘vision, measure, seizure, beige’) 

word initial—(in French loan words) gigolo, gigue, jabot, genre 

word medial—pleasure, leisure, usual, confusion, decision 

word final (only in French loan words; an alternative pronuncia- 
tion with /d3/ is possible)—prestige, barrage, rouge, beige 


!/tf/, /d3/, having been treated as single complex phonemic entities, are not con- 
sidered here as initial or final clusters. | 
2 An epenthetic /t/ may be inserted, thus /mentfn(z, d)/. See $8.4.4(3). 
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Fic. 37.—/f, 3/. 


in word initial clusters '—does not occur 
in word final clusters—/3n(z)/ vision(s); in French words, when said 
with final /3/, the cluster /3d/ is possible, e.g. in camouflaged. 
Moreover, for those who use final /3/, rather than /d3/, after /n/, 
e.g. In arrange(d), the clusters /n3(d)/ may occur. 
Compare /5/, /tf/—sheep, cheap; shore, chore; shoes, choose: leash, 
leech; dish, ditch; wash, watch 
/3/, /d3/—leisure, ledger; vision, pigeon 
/5/, /3/—Aleutian (when pronounced /o'lju:fn/), allusion; 
Confucian (when pronounced /kon fju:fn/, confusion 


(2) Description.—The soft palate being raised and the nasal resona- 
tor shut off, the tip and blade of the tongue make a light contact with 
the alveolar ridge, the front of the tongue being raised at the same time 
in the direction of the hard palate and the side rims of the tongue being 
in contact with the upper side teeth. The escape of air is diffuse (com- 
pared with that of /s, z/), the friction occurring between a more exten- 
sive area of the tongue and the roof of the mouth. The articulation is 
also laxer than that of /s, z/. The palatalization effect (i.e. the [1]-ness 
caused by the raising of the front of the tongue) is less marked than in 
sounds of the [f, 3] type in some other languages, indicating either that 
the front raising is less close or that the tongue as a whole is slightly 
more retracted. In the case of /f/, the friction is voiceless, whereas for 
/3/ there may be some vocal fold vibration according to its situation 
(see §8.4(3)). Some speakers use slight lip-rounding for /f, 3/1n all posi- 
tions, for others, lip-rounding is an effect of the adjacent vowel, e.g /f/ 
of shoe tends to be lip-rounded whereas /J/ of she has neutral or spread 
lips. 

(3) Variants.—Apart from the degree of palatalization or lip- 


1 See footnote on previous page. 
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rounding used, no important articulatory variants occur. Medially 
in certain words, however, /f, 3/ are not used by all speakers: 
(a) especially before /u:/ or /u/, there is often variation between /f, 3/ 
and /s, z/ + /j/, e.g. in issue, sexual, tissue, inertia, seizure, casual, 
usual, azure (cf. assume with /sj/ and assure with /f/); (b) a similar 
variation between /f/ or /s/ + /1/ + vowel, e.g. ratio, appreciate, 
negotiate; (cC) the sequence /s, z/ + /1/ (or /j/) + vowel is regularly 
kept in such words as hosier, axiom, gymnasium, Parisian (note 
American English /parizn/) and especially before the /a/ comparative 
adjectival inflexion, e.g. easier, lazier; (d) the lack of words 
distinguishable by /f/ and /3/ results in possible alternations between 
/{/ and /3/, e.g. in Asia, Persia, transition, version. In word final 
position, where /3/ exists only in comparatively recent French loan- 
words, e.g. beige, rouge, prestige, garage, etc., a variant with /d3/ is 
always possible and is felt to be the fully anglicized form. It will be seen 
that /3/, rare initially in a word, replaceable by /d3/ finally and some- 
times varying with /f/ medially, has a particularly weak “functional 
load’ in English. 

(4) Chief sources.—PresE /f/ derives from OE [f] < earlier OE 
[s] + [k] or [c] (ship, shadow, bishop, fish, English); from OF pala- 
talized [s] (cushion, cash, radish, finish); from a coalescence of /s/ + 
/3/ or /1/ finalized in the seventeenth century (sure, sugar, ambition, 
ocean, special, patient)—note that in the seventeenth and eighteenth 
centuries, several words at present with /sju:/ were pronounced with 
/5/ (suit, supreme, assume); from French /f/ in words adopted after 
earlier French /tf/ had given /f/ (chemise, chic, machine, charlatan, 
moustache). 

PresE /3/ derives from a coalescence of /z/ + /j/ or /1/ finalized in 
the seventeenth century (occasion, measure, treasure, usual), /3/ being 
considered for a long time as a foreign sound; from French /3/ in words 
adopted after earlier French /d3/ had given /3/ (prestige, rouge, belge, 
bijou)—many of these words still retaining a foreign flavour. 

(5) Advice to foreign learners.—Many languages do not possess 
palato-alveolar fricatives and others have fricatives in this region but of 
a more strongly palatalized kind. In these cases, learners should avoid 
using sound sequences which will strike the English ear as /s/ or /z/ + 
/3/5 a Slight retraction of the tongue will often suitably ‘darken’ the 
quality of the friction. Attention should also be paid to the maintenance 
of the lenis character of /3/ (with voicing where appropriate), despite 
the fact that oppositions between /f/ and /3/ are rare, except in the 
affricate complexes /tf/ and /d3/. 
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8.4.6 Glottal Fricative 
/h/ 
(1) Examples 


/h/— fortis (spelt h, wh, e.g. ‘how, hat, who, whom’) 
word initial—heat, hen, ham, hot, horse, who, hate, hoe, high, 
how, here, hair, hallo 
word medial—ahead, behave, perhaps, behind, spearhead, anyhow, 
manhood, abhor, adhere 
Compare /h/ + vowel, initial vowel—heat, eat; hill, ill; hedge, edge; 
harbour, arbor; haul, all; hate, eight; hold, old; hear, ear 


(Note: h is not pronounced initially in hour, honest, honour, hetr, 
heiress; medially in such words as exhaust, exhilarate, exhibit, vehicle, 
vehement; and in some final suffixes, e.g. shepherd, Durham, 
Clapham, etc.) 


(2) Description.—Since English /h/ occurs only in syllable initial, 
pre-vocalic positions, it may be regarded as a strong, voiceless onset of 
the vowel in question. The air is expelled from the lungs with con- 
siderable pressure, causing some friction throughout the vocal tract, 
the upper part of which is shaped in readiness for the articulation of the 
following vowel (i.e. as regards the position of the tongue, lips, soft 
palate, and the configuration of the pharynx). Thus differing types of 
friction (patterns of resonance) will be heard for /h/ in the sequences 
/hi:/, /ha:/, /hu:/. The friction being, therefore, largely of the mouth 
cavity type associated with the nature of the following vowel, this 
syllable-marginal sound is considered by many writers to share notable 
characteristics with vowel (vocoid) articulations and is interpreted 
phonemically as being in complementary distribution with the second 
element of the long vowels /a:,3:, 3:/ and the diphthongs /19, €3, u9/. 
Since, however, the common feature of all types of pre-vocalic /h/ is 
the passage of a strong, voiceless air-stream through the open glottis, 
the sound is here referred to as a fortis, voiceless, glottal fricative. With 
the onset of the vocal fold vibration of the vowel, the air-pressure is 
reduced. There is no distinctive fortis/lenis opposition such as 
characterizes the other English fricatives. 

(3) Variants. —Although /h/ functions in English essentially as a 
voiceless syllable-initial margin (in the same way that /n/ occurs only in 
post-syllabic positions), a few speakers use a voiced (or slightly voiced) 
allophone medially bctween voiced sounds, e.g. in such words as 
anyhow, perhaps, behind. In such pronunciations, the strong air- 
stream of /h/ is accompanied by vocal fold vibration, the rcsult being a 
kind of breathy vowel or voiced glottal fricative [fi]. 

Jn many types of popular regional spcech, /h/ is lost, so that no 
distinction is made between such RP minimal pairs as hill, ill, high, eye; 
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hair, air. Usually in such speech, the /h/ words will behave as if they 
had an initial vowel, e.g. a hill /an '11/, the house /d1 'aus/, but some- 
times a trace of the boundary marking function of /h/ will be shown in 
the use of [?], or at least a weak glottal constriction, e.g. a hill [a l], the 
hospital [a ':vspitl]. On the other hand, the usual linking forms 
between article and word with initial vowel may be replaced by a glottal 
constriction or weak [h], e.g. an egg [oeg] or [o 'teg], the end [da '"end] 
or [a end]. 

Such loss of /h/ is usually considered characteristic of uneducated 
speech, but certain form words (especially have, has, had, pronouns 
and pronominal adjectives) frequently lose /h/ in RP in unaccented, 
non-initial, situations in connected speech (see §10.4), e.g. he pushed 
him on his back /hi: ‘puft ım bn ız beek/, I could have hit her /at kad av 
Mit 9/. 

Some RP speakers treat an unaccented /-syllable, as in historical, as 
if it belonged to the special group hour, honest, etc., i.e. without an 
initial /h/, e.g. an historical novel. Such pronunciations, and also that 
of humour as /ju:mo/, are nowadays used only by a minority of 
speakers. In the case of hotel, however, an /h/-less form is fairly widely 
spread, e.g. an hotel, though the pronunciation with initial /h/ 1s 
commoner. (For /h/ + /], w/, see §8.9.) 

(4) Chief sources. —OE had /h/ (= [h], [x], or [ç]), not only initially 
in a syllable before a vowel, but also in initial clusters, e.g. /hn/ 
(hnutu), /hi/ (hlaf), /hr/ (hréosan), /hw/ (hwéol), where the realiza- 
tions may have been [n, I, r, w] or [xn, xl, xr, xw]; also in medial and 
post-vocalic positions, with palatal or velar fricative realizations, e.g. 
hliehhan, áhte, héah. Of these occurrences, PresE retains only initial 
/h/ before a vowel (see §8.9.2 for /hw/), e.g. home, high, help, horse, 
etc. In addition, a new initial /h/ was introduced into English in some 
French words of Germanic origin, e.g. hardy, haste, herald, and also in 
a large number of French words of Latin origin, e.g. herb, horror, 
habit, harass, heretic, hospital, host, humour, etc. In the case of this 
latter group, spelling with A both in OF and ME was erratic, probably 
representing an /h/-less pronunciation. In certain words (hour, heir, 
heiress, honour, honest), the /h/-less pronunciation has been kept 
despite the general adoption of the spelling A. In most other cases, the 
letter 4 began to be pronounced in late ME in these words of Latin (via 
French) origin, though in some the spelling A was not sounded as late as 
the eighteenth and nineteenth centuries, e.g. herb, hospital, humble, 
humour. 

The general elision of /h/ in weak pronouns and auxiliary verbs 
probably dates at least from eModE, whereas the loss of /h/ in accented 
words, as in modern popular London speech, has existed for at least 
two centuries and has always been considered a vulgarism. 

(5) Advice to foreign learners.—Many languages do not possess a 
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phoneme of the /h/ type. Speakers of these languages should, in 
learning English, practise the examples given in (1) above, making a 
correct distinction between words with initial /h/ + vowel and those 
with initial vowel, e.g. hill, il, etc.; they should also learn to elide the 
/h/ of he, him, his, her, have, had, has, when these words occur in 
weakly accented, non-initial positions in the utterance. 

Those learners who in their own language have no [h] but [x] or who 
have an [h] said with’some velar friction, e.g. in some Slav and Chinese 
languages, should avoid using any velar friction in English, but should 
practise the English /h/ as a voiceless onset to the following vowel. 


8.5 Fortis and Lenis as Phonological Categories 


It will be seen from the preceding sections that in various ways the 
members of the class /p, t, k, f, 8, s, f, tf/ behave similarly to each other 
and differently from their counterparts in the class /b, d, g, v, 0, Z, 3, 
d3/. Itis not primarily in the matter of relative ‘strength’ of articulation 
but in their behaviour that the members of each pair differ. Thus, to 
summarize, (i) members of the fortis series are voiceless inter- 
vocalically whilst those of the lenis series are voiced in the same environ- 
ment; (11) the fortis plosives /p, t, k/ are aspirated (and voiceless) when 
initial in a stressed syllable, whilst their lenis counterparts /b, d, g/ are 
unaspirated (and considerably devoiced if initial in an utterance or if 
following a voiceless consonant); (iii) all the fortis consonants cause 
pre-fortis clipping whilst the vowels, nasals and laterals preceding 
members of the lenis series display no such reduction in duration. For 
this reason it is generally regarded as most useful to treat the fortis/lenis 
distinction as primarily a phonological classification which accounts 
for a complex of realizational features. 


Class B 
Voiced Non-fricative or Glide Consonants 
(Sonorants) 


Although voiceless fricative allophones of the following consonants 
occur, their most common realizations are voiced and non-fricative. 


8.6 Nasals 


(1) Articulatory features.—Nasal consonants resemble oral plosives in 
that a total closure is made within the mouth; they differ from such 
plosives in that the soft palate is in its lowered position, allowing an 
escape of air into the nasal cavity and giving the sound the special 
resonance provided by the naso-pharyngeal cavity. Since the air-stream 
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may escape freely through the nose, nasal consonants are continuants; 
they differ, however, from continuants such as fricatives in that no 
audible friction is produced and from the fact that they are usually 
voiced, without significant fortis/lenis or voiced/voiceless oppositions. 
In many respects, therefore, being normally frictionless continuants, 
they resemble vowel-type sounds. 

(2) Acoustic features.—The usual voiced nasal consonant has no 
noise component such as results from the burst of plosives or the 
friction of fricatives, nor the silence gap associated with plosives; they 
present, on the contrary, like vowels, a regular periodic wave pattern 
during their steady state with a characteristic formant structure and the 
periodic component producing voice. The soft walls of the added nasal 
cavity entail very considerable damping (especially in the 1 ,000—2,300 
cps band) and broadening of the resonance band-widths. The best 
responses for the various nasal consonants have been achieved 
synthetically! with the following formant structures: /m/, F1 200 cps, 
F2 1,100 cps, F3 2,500 cps; /n/, F1 200 cps, F2 1,700 cps, F3 2,500 cps; 
/n/, F1 200 cps, F2 2,300 cps, F3 2,500 cps, the situation of F2 being 
the variable characteristic. This formant may, however, be severely 
mutilated in the nasals of actual speech, so that strong cues for distinc- 
tion among types of nasal appear to be provided’ by characteristic 
curves (transitions) of F2 of the adjacent vowel—comparable with 
those characterizing equivalent plosive articulations, e.g. a bilabial 
nasal [m] shows, like [b], extensive minus transitions; an alveolar nasal 
[n], like [d], plus or minus transitions; and a velar nasal [n], like [g}, 
plus transitions. 

(3) The English nasal consonants.—(a) Three nasal phonemes cor- 
respond to the three oral plosive areas of articulation: bilabial /m/-/p, 
b/; alveolar /n/-/t, d/; velar /n/-/k, g/. If, in the articulation of a 
nasal consonant, the nasal passage is blocked as, for instance, often 
happens during a cold, /m, n, n/ will be realized as /b, d, g/, e.g. 
morning /'bo:dig/, some nice lemons /sab ‘dais 'lebadz/. Oppositions 
amongst the nasals may be illustrated as follows:— 


Bilabial /m/ Alveolar /n/ Velar /n/ 
Initial might night = 
Medial simmer sinner singer 
Final sum sun sung 


It will be seen that, since /n/ does not occur initially in a word or 


l! See Kazuro Nakata, ‘Synthesis and perception of nasal consonants,’ J/.A.S.A., 31, 6, 
1959. 

2See Liberman, Delattre, Cooper, Gerstman, ‘The rôle of consonant-vowel transitions 
in the perception of the stop and nasal consonants, Psychological Monograph 379, 1954; 
also A. Malécot, ‘Acoustic cues for nasal consonants,’ Language, vol. 32, 2, 1956. 
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morpheme, a complete series of oppositions is found only where the 
nasals occur in post-vocalic positions in the same syllable or morpheme. 

(b) The vocalic nature of the nasals is underlined by the fact that they 
readily perform the syllabic function of vowels: most often /n/, e.g. 
mutton [matn]; less commonly /m/ e.g. rhythm [ridm]); occasionally, 
with some speakers, /n/, e.g. bacon [‘beikn). 

(c) Although no opposition occurs between voiced and voiceless 
nasals in English, a somewhat devoiced allophone of /m/ and /n/ may 
be heard when a voiceless consonant precedes, e.g. smoke, smart, top- 
most; snake, sneeze, chutney. The distribution of /n/ being restricted, 
it is only rarely—in a syllabic situation as in bacon—that a voiceless 
consonant precedes, with the consequent partial devoicing. 


8.6.1 Bilabial Nasal 
/m/ 
(1) Examples 


(regularly spelt with m; mm, e.g. ‘meat, summer’; sometimes mb, 
mn, e.g. ‘comb, autumn’) 
word initial—meal, mat, march, move, mirth, make, mouse 
following word initial /s/—smack, smock, smite, smoke, smear 
word medial—demon, glimmer, lemon, salmon, Thomas, among, 
gloomy, summer, sermon, commit, omen; jumper, timber, empty, 
comfort, hamlet, simple, symbol, dismal, camel, dimly, asthma 
word final—seem, lamb, harm, warm, tomb; game; (in final clusters) 
worms, harmed, film(s), warmth, glimpse, prompt(s), nymph(s); 
(syllabic) rhythm(s), prism(s), lissom. 


(2) Description.—The lips form a closure as for /p, b/; the soft 
palate is lowered, adding the resonance of the nasal cavity to those of 
the pharynx and the mouth chamber closed by the lips; the tongue will 





FIG. 38.—/m/. 
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generally anticipate or retain the position of the adjacent vowel or /l/. 
Except when partially devoiced by a preceding voiceless consonant, e.g. 
initially—smoke, medially—topmost, finally—happen, /m/ is voiced. 
(Normal expiration through the nose with the lips closed may be 
described as a weak [m]; where, because of some organic defect such as 
cleft palate, the nasal cavity cannot be shut off, /p/ may be realized as 
[m] and /b/ as [m].) When followed by a labio-dental sound /f, v/, the 
front closure may be labio-dental [m] rather than bilabial, e.g. in 
nymph, comfort, triumph, come first, circumvent, warm vest (see also 
[m] for /n/, 8.6.2(2)). 

/m/ frequently results in context from a final /n/ of the isolate word 
form before a following bilabial, e.g. one mile /'wam ‘mail/, more and 
more /'mo:r am 'mo:/, ten pairs /'tem ‘peaz/, gone back /'gom 'bek/; 
sometimes /m/ is a realization of word final /an/ or /n/ following /p/ 
or /b/, e.g. happen /'heepm/, ribbon /'ribm/, or, in context, cap and 
gown / keep m 'gaun/ (see $11.3.5). 

(3) Variants.—There are no important regional or social variants of 
/m/ articulations. 

(4) Chief sources. —PresE /m/ derives from OE [m], [mm] (man, 
hammer, swim, home)—note [b] of [mb] was lost in word final 
positions in ME in such words as climb, lamb, though kept medially as 
in timber, or inserted as in thimble, slumber, bramble, before /1/ and 
earlier /r/; from OF [m] (master, family, solemn, sum). 

(5) Advice to foreign learners.—This phoneme should present no 
difficulty. 


8.6.2 Alveolar Nasal 
7 


(1) Examples 


(regularly spelt with n, nn, e.g. ‘now, noon, funny’; or kn, gn, pn, 
e.g. ‘know, gnaw, sign, pneumonia’; also /bn/ as a realization of 
French /@/, e.g. ‘rendez-vous’) 

word initial—neat, knit, net, gnat, knot, gnaw, none, nurse, name, 
know, near 

following word initial /s/—sneeze, snatch, snore, snug, snake, 
snow, sneer 

word medial—dinner, many, hornet, monitor, annoy; wonder, 
hunter, unless, unrest, answer, pansy, infant, invoice; chutney, 
madness, amnesty, walnut, fastener, evening 

word final—mean, pen, gone, soon, learn, melon, down, coin; 
pint(s), pond(s), inch, hinge, final(s), pence, pens, month(s), 
kiln(s), rental(s), bundle(s), pencil(s), against 

syllabic /n/—*open, *ribbon, *sicken, *organ (*more commonly 
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with /-on/), cotton, sudden, often, oven, earthen, southern, listen, 
dozen, mission, vision; maddening (or with non-syllabic /n/), 
reasonable (or with non-syllabic /n/ or /on/), ordinary (or with 
non-syllabic /n/ or /an/ or /in/), southerner (or with /on/) 


(2) Description.—The tongue forms a closure with the teeth ridge 
and upper side teeth as for /t, d/; the soft palate is lowered, adding the 
resonance of the nasal cavity to those of the pharynx and of that part of 
the mouth chamber behind the alveolar closure; the lip position will 
depend upon that of adjacent vowels, e.g. spread lips in neat, keen; 
neutrally open lips in snarl, barn; somewhat rounded lips in noon, 
soon. Except when partially devoiced by a preceding voiceless con- 
sonant, e.g. initlally—snug, medially—chutney, finally—cotton, /n/ is 
voiced. (Where, because of an organic defect such as cleft palate, the 
nasal cavity cannot be shut off, /t/ may be realized as [n] and /d/ as 
[n].) The articulation of /n/ is particularly liable to be influenced by 
that of the following consonant, e.g. when followed by a labio-dental 
sound /f, v/, as in infant, invoice, on fire, in vain, /n/ may be realized 
as [m]—thus overlapping with realizations of the /m/ phoneme (see 
8.6.1); /n/ before dental sounds /8, ð/ is realized with a lingua-dental 
closure [n], as in tenth, when they—and sometimes when following 
/8, O/ (earthen, southern); before /r/, /n/ may have a post-alveolar 
contact, as in unrest, Henry; in addition, in context, word final /n/ 
frequently assimilates to a following word initial bilabial or velar con- 
sonant, being realized as /m/ or /n/, e.g. ten people, ten boys, ten men, 
where the final /n/ of ten may assimilate to /m/, and ten cups, ten girls, 
where the final /n/ of ten may assimilate to /n/ (see $11.3.4). 

(3) Variants.—There are no important regional or social variants of 
/n/ articulations. 

(4) Chief sources.—PresE /n/ derives from OE [n, nn] (name, 
many, man, spin); from OE [hn], [xn] or [n] (mut)—by the twelfth 
century; from OE [kn, gn] (know, knit, gnaw, gnat)—by the late seven- 





Fic. 39.—/n/. 
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teenth century; from OF [n] (noble, dinner, plain); from OF palatal 
[n] (sign, reign, line, mountain, onion). Note that sometimes earlier 
[n] > [nt, nd] (pheasant, tyrant, ancient, sound, pound); and that 
sometimes [n] of the indefinite article has become affixed to a noun 
beginning with a vowel (newt, nickname), or initial [n] of a noun has 
been lost through transference to the preceding indefinite article form 
(adder, apron, umpire). | 

(S) Advice to foreign learners. —Care should be taken that /n/, like 
/t, d/, is normally given an alveolar rather than a dental articulation 
(see practice examples in (1) above). 


8.6.3 Velar Nasal 
/n/ 
(1) Examples 


(regularly spelt ng—or n followed by a letter indicating a velar con- 
sonant—e.g. ‘sing, tongue, sink, anxious, uncle’; also /pn/ as a 
realization of French /4/, e.g. ‘restaurant’; /n/ occurs usually 
after the short vowels /1, Æ, D, A/; rarely after /e/) 
word medial—singer, hanger, longing, anxiety 
word medial + /g/—finger, anger, angry, hunger, strongest, lan- 
guage, single, angle, England, bungle, nightingale 
word medial + /k/—tinker, anchor, banquet, monkey, donkey, 
conquer, wrinkle, ankle, uncle, anxious 
word final—sing(s), hang(s, ed), wrong(s, ed), tongue(s, ed), among 
word final + /k/—sink(s), rank(s, ed), conch(sometimes/kpnk/), 
chunk(s), monk(s), distinct, amongst (or with /nst/), strength 
(with /9k0/, /n8/, or sometimes /n8/) 
word final syllable—(occasionally) bacon, taken, thicken, blacken, 
organ 
/y/, or less commonly /n/, in such words as—income, conclude, 
encourage, concrete, bronchitis, engage, enquiry 
Compare /n/, /n/—sing, sin; rang, ran; hanged, hand; sung, sun; 
mounting, mountain; gong, gone; robbing, robin 
/y/, /ynk/—thing, think; rang, rank; sung, sunk; singing, 
sinking; hanger, hanker 


(2) Description.—A closure is formed in the mouth between the 
back of the tongue and the velum as for /k, g/ (the point of closure will 
depend on the type of vowel preceding, the contact being more 
advanced in sing than in song); the soft palate is lowered, adding the 
resonance of the nasal cavity to that of the pharynx and that small part 
of the mouth chamber behind the velar closure; the lip position will 
depend upon that of the preceding vowel, being somewhat spread in 
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sing and relatively open in song. /n/ is normally voiced, except for 
partial devoicing in the possible, though less common, case of syllabic 
/y/ in such words as bacon, thicken. (Where, because of an organic 
defect such as cleft palate, the nasal cavity cannot be shut off, /k/ may 
be realized as /n] and /g/ as [n].) Word final /n/ may result in context 
from the isolate word final form /n/, e.g. ten cups (see §§8.6.2(2), 
11.3.4). 

(3) Variants.—Earlier [ng] forms are retained, instead of RP /p/, in 
many regional types of speech, notably in the Midlands and north of 
England, e.g. singing [singing] for RP /'sinm/. If /g/ is always pro- 
nounced in such situations, [p] must be counted an allophone of /n/ 
rather than of a separate phoneme. l 

In some forms of regional speech, and also of some conservative RP, 
where /n/ is a phoneme (sin, sing, being distinguished merely by the 
type of final nasal), the termination -ing is pronounced /m/, as in 
meeting, nothing, thus producing homophones of the type robbing, 
robin. 

In popular London speech, in which /n/ is phonemic (cf. sin, sing), 
the word -thing in compounds is often pronounced /-fink/, e.g. in 
something, anything, nothing; the verbal termination -ing may be /-1n/ 
or /-in/ without /k/. 

(4) Chief sources. —PresE /p/ occurs in words of OE origin (sing, 
hunger, thank, tongue) or of OF origin (frank, rank, conquer, 
language), following a short vowel. So long as [n] occurred only before 
/k,g/, the nasal was an allophone of /n/. But with the loss of /g/ in the 
sequence [ng], /n/ gained a phonemic significance, based on opposi- 
tions of the sing, sin type. Although loss of /g/ in [ng] may have 
occurred in popular dialects in late ME, it seems that the plosive (and 
allophonic nature of [n]) was retained in educated speech until the end 
of the sixteenth century. At first, in the late sixteenth and seventeenth 
centuries, /g/ was elided only before consonants, the loss of /g/ in final 
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[ng] before a vowel or a pause becoming general during the seventeenth 
century. 

The pronunciation /-in/ for the termination -/ng, common in the six- 
teenth and seventeenth centuries, became a general feature of fashion- 
able speech in the eighteenth century and has been retained to-day as a 
characteristic of an archaic form of RP (e.g. in huntin”, shootin’,-and 
fishin’). It is to be noted that the sequence [nk] has not lost the plosive 
element and [n] in such a situation could still be treated as an allophone 
of /n/, were it not for the emergence of /n/ as a phoneme as a result of 
the reduction of [ng] > /n/. The very restricted distribution of /n/ in 
PresE is, therefore, imposed by the circumstances of its development. 

(5) Advice to foreign learners.—Care should be taken to articulate a 
velar sound (i.e. one related to /g/) rather than a palatal nasal of the [y] 
type. 

Those learners whose own language has [n] only as an allophone of 
/n/ before /k, g/ should avoid using /g/ (or more rarely [k]) in those 
cases where /n/ occurs in English without a following plosive, 
especially in sequences where final /n/ is followed by a vowel, e.g. in 
singing, reading out, along essay. In practising, the nasal /n/ should be 
given exaggerated length and sequences such as /n1:, na:/ repeated in 
order to obtain a succession of nasal + vowel without a plosive. 


8.7 Laterals 


(1) Articulatory features.—A lateral consonant is articulated by means 
of a partial closure, on one or both sides of which the air-stream is able 
to escape through the mouth. In English, the tip of the tongue arti- 
culates normally with the centre of the upper teeth ridge; if the air 
escape is unilateral, an occlusion will also be made on one side between 
the rim of the tongue and the upper side teeth. English lateral sounds 
are usually voiced and frictionless, falling into the same category of 
voiced continuants (sonorants) as the nasals and /r/ and, to a lesser 
extent, /], w/ (see following sections). They are, therefore, in many 
respects vowel-like, and often perform as vowels, 1.e. have a central 
syllabic function. Most commonly, however, they behave as con- 
sonants, 1.e. have a non-central situation in the syllable and, in the case 
of the voiceless fricative allophone [l], are consonantal from a purely 
phonetic standpoint. 

(2) Acoustic features.'—The more usual types of English voiced 
lateral continuant have in their steady state a three-formant structure 


1See O'Connor, Gerstman, Liberman, Delattre, and Cooper, ‘Acoustic cues for the 
perception of initial /w, j, r, 1/ in English,’ Word, vol. 13, no. 1, 1957; Leigh Lisker, 
‘Minimal cues for separating /w, J, r, 1/ in intervocalic positions,’ Word, vol. 13, no. 2, 
1957. 
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together with a fundamental voice component comparable with that of 
vowels. A steady state of from 50 to 60 msecs duration is needed if the 
continuant nature of the consonant is to be perceived, whereas a longer 
duration produces an effect of syllabicity. In cases of adjacent lateral 
and vowel, however, transitional cues are important in distinguishing 
[I] sounds from other continuants or glides such as /r, w, J/. Fl of /I/ is 
situated at or below Fl of the adjacent vowel, but not lower than 360 
cps; F2 is within the range 960—1,800 cps before vowels of the [1, e, e] 
type, within 840—1,800 cps for [a], and within 840—1,680 cps for such 
vowels as [9, o, u] (in all cases depending on the F2 of the vowel); F3, 
crucial for distinguishing /l/ from /r/, is close to that of the vowel F3 
before [i, e, e] and as low as 1,920 cps before [9, o, u]. Impressions of 
‘clear’ and ‘dark’ qualities in the English lateral sounds are determined 
by the frequency of F2, the lower the frequency the ‘darker’ the [l] 
sound. The actual duration of the transition is also important—too 
short a transition risking confusion with nasals and stops, whereas a 
long transition will suggest a vowel glide. For [1] sounds, F2 and F3 have 
the best transition-durations at about 60—70 msecs; Fl transitions have 
to be much more rapid—of the order of 10 msecs. 


8.7.1 The English lateral phoneme /\/.—Only one alveolar, lateral 
phoneme occurs in English, there being no opposition between fortis 
and lenis, voiced and voiceless, or fricative and non-fricative. Within 
the /1/ phoneme three main allophones occur:— 

(a) Clear [1], with a relatively front vowel resonance, before vowels 
and /j/; i 

(b) Voiceless [1], following accented (aspirated) /p, k/ (less consider- 
able devoicing occurs after /s, f, 8, f/ or weakly accented /p, t, k/ 
—marked with * in the examples below); the tongue position is as for 
dark [t] when syllabic but otherwise as for clear [1]; 

(c) Dark [t], witha relatively back vowel resonance, finally (in a pre- 
junctural position) after a vowel, before a consonant, and as a syllabic 
sound following a consonant. 

Note that the rule for the distribution of clear and dark /l/ works 
across boundaries e.g. the /l/ in feel it is a clear [l] before the initial 
vowel of it. 


(1) Examples 


(regularly spelt /, H e.g. light, fill’; /, in pre-consonantal position 
is, however, frequently silent, e.g. ‘talk, should, half, calm, 
salmon, folk, Holborn’) 

(a) Clear ||] 

word initial—leave, let, lock, look, late, loud, leer, lewd 

in word initial clusters—blow, glad, splice, *fly, *slow 

word medial—silly, yellow, alloy, collar, caller, pulley, foolish, 
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sullen, sailor, island, oily, million, failure, allow, select; medley, 
ugly, nobly, gimlet, inlay, bachelor, specially (with [1] or [al]), 
*afflict, *asleep, *pamphlet 
word final, intervocalic in context—feel it, fall out, all over, will you 
(b) Voiceless [1] —play, clean; (less devoicing after a weakly accented 
fortis plosive) *aptly, *butler, *hopeless, *sprinkler, *couplet, 
*antler, *simplest, *ghastly 
(c) Dark [1] 
word final, after vowel—feel, fill, fell, canal, snarl, doll, call, bull, 
pool, dull, pearl, pale, pole, pile, owl, oil, royal, real, cruel 
after vowel, before consonant—help, bulb, salt, cold, milk, filch, 
bilge, film, kiln, self, solve, health, else, bills; alpine, elbow, 
halter, elder, alchemy, almost, illness, alphabet, silver, wealthy, 
although, ulcer, palsy, Welsh, always 
syllabic [1] —*apple, table, *little, middle, *buckle, eagle, *satchel, 
cudgel, camel, final, quarrel (or with [at]), *awful, oval, *Ethel, 
*parcel, easel, *special, usual, spaniel (or with [-391]), Oswald (or 
with [-watd]), equal, *simple, tumble, *mantle, fondle, *uncle, 
angle, *sinful, *pistol; doubled, tables, measles, finally (or with 
[-al-]; cf. ‘finely’ with [l]) 
variations in inflected forms of verbs having [{] in the uninflected 
form—[}], or [al] (more rarely [l])—pommelling, tunnelling, 
cudgeling [I] (more rarely [f] or [al])—fondling, doubling, 
*circling, wriggling, *settling, *coupling, *whistling, puzzling, 
*scuffling, shovelling 
It may be noted that some speakers distinguish between participles 
and verbal nouns, using clear [1] for the latter and dark [1] or [al] for the 
former; thus *coupling ['kaplin] (connecting device) and [kaptn] 
(joining). See §9.6. 


(2) Description.—The soft palate being in its raised position, 
shutting off the nasal resonator, the tip of the tongue is in contact with 
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FIG. 41.—/I/; clear [1]. FIG. 42.—/1/; dark [t]. 
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the upper teeth ridge, allowing the air to escape on both sides or, in the 
case of a unilateral tongue-rim closure on the upper side teeth, on one 
side. For clear [1], the front of the tongue is raised in the direction of the 
hard palate at the same time as the tip contact is made, thus giving a 
front vowel resonance to the consonant; this resonance is often of the 
[é] type, but may be closer or more open according to the following 
vowel, cf. leap, loop. For dark [t], the tip contact is again made on the 
teeth ridge, the front of the tongue being somewhat depressed and the 
back raised in the direction of the soft palate, giving a back vowel (or 
velarized) resonance; this resonance is often of the type [6] or [x]. The 
lips’ position is influenced by the nature of the adjacent vowel, cf. /eap, 
feel (with spread lips), loop, pool (with somewhat rounded lips); in the 
case of [t], and especially [t], there is, with some speakers, always a 
tendency to lip-rounding. 

Both [1] and [t] are voiced, though partial devoicing may take place 
when a preceding consonant is fortis, as in those plosive (especially) and 
fricative examples marked with * in (1) (a), (c) above; after syllable 
initial, accented /p, k/, the devoicing is frequently complete. 

The actual point of contact of the tongue for [t] is conditioned by the 
place of articulation of the following consonant; thus, in health, will 
they, the [t] has a dental contact (to a lesser extent, a preceding /6, 0/ 
may cause a dental articulation [1], e.g. in a month late, with love); or, 
in already, ultra, all dry, the contact for [t] is likely to be post-alveolar. 
[t] may also be strongly nasalized by a following nasal consonant, e.g. 
elm, kiln, 

The velarization of [t] often has the effect of retracting and lowering 
slightly the articulation of a preceding front vowel, e.g. feel, fill, fell, 
canal; in the case of /1:/ + [t], a central glide between the vowel and [1] 
is often noticeable, and the [1] element of /el, al, 91/ tends to be 
obscured, e.g. in pail, pile, oil. 

(3) Variants.—The RP distribution of [l] and [t] may be said to be: [l] 
when a vowel or /j/ follows; [t] in all other positions, including some 
cases where syllabic [t] occurs before a vowel. Variations in the quality 
of the back vowel resonance associated with [t] are, however, to be 
found among RP speakers, with a range extending from [6], [u], or [y] 
to [5] or [A]; lip positions, too, vary from neutral to loose rounding. In 
some speech, notably that of Cockney,' the tongue-tip contact for [tł] is 
omitted, this allophone of /1/ being realized as a vowel (vocoid) in the 
region of [6] with weak lip-rounding or as [y] with neutral or weakly 
spread lips, thus sell [sed] or [sex], fall [fod] or [fox], table ['tzeibd] or 
[‘tzeiby]. In such speech, the lowering of /i:/ and /u:/ before [6] is so 
marked that :neal, mill, and pool, pull, may become homophonous or 


'For a full phonetic and phonemic treatment of Cockney [t], see Eva Sivertsen, 
Cockney Phonology, Oslo, 1960. 
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distinguished merely by the length of central syllabic vowel element, i.e. 
[mr ö] v. [mi6], [pu'ó] v. [pu]; other confusions are likely, e.g. rail 
(RP [rert], Cockney [reeó]) and row (RP [rau], Cockney [reeó)), dole 
(RP [daut], Cockney {[dp6]) and doll (RP [dot], Cockney [doó)). 
Realizations of syllabic [1] as a vowel are not, however, confined to 
Cockney; many RP speakers will use [6] for [t] in words such as 
beautiful, careful, people, table, etc., i.e. especially when a consonant 
involving a labial articulation precedes; they will, moreover, not 
recognize such a vocalic allophone as unusual when they hear it. The 
use of a vocalic allophone seems somewhat less usual when other 
consonants precede, e.g. in uncle, Ethel, parcel, special, spaniel, and is 
particularly avoided, as a childish pronunciation, after alveolar 
plosives, e.g. little, middle, where the consonants are regularly released 
laterally. On the other hand, in an artificial or somewhat precious style 
of speaking, [l] may be used for [tł]. 

It has been seen (in (1) (a) above) that word final non-syllabic [1] 
becomes [1] when a following word begins with a vowel or /j/. In the 
same way, syllabic [1] may tend to become clearer when a vowel follows, 
e.g. finally (when pronounced [‘fainli)), acetylene (when [2 setli:n]), 
special edition, parcel of books, little or nothing, middle of the road, 
etc. The use of the clearer or the darker quality [1] may, however, be 
related to the rhythmic and sense structure of the utterance, e.g. feel it 
may have a clear [1] as in feeling, but the [1] of feel in feel ill, which it is 
possible to treat as two rhythmic groups, may be of a darker kind (see 
Abercrombie, 1965). 

In other varieties of English the RP distribution of [1] and [t] may not 
obtain. Thus, in some kinds of Scottish and American English, /l/ 
before vowels and /j/ is realized with a back vowel resonance, whereas 
in Irish English a relatively clear [1] is used in those situations where RP 
would have [t]. Note, too, that American English has syllabic [t] in 
words such as fertile, futile, missile, reptile, etc., where RP retains a 
prominent preceding vowel [-ait]; a reduction of the vowel, similar to 
the present American form, occurred in seventeenth-century British 
English. 

(4) Chief sources.—PresE /l/ derives from OE [l, 11] (and, climb, 
all, tell, apple), from OE [hl], [xl] or [1] (Loaf, ladder)—since OE front 
vowels tended to be diphthongized before /1/, it seems likely that /1/ in 
such a position was velarized in OE as it is to-day; from OF [1] (amp, 
close, colour, veal, able). In many cases pre-consonantal [tł], especially 
after back or open vowels, was vocalized to [u] (walk, talk, half, folk) in 
the early fifteenth century, such a pronunciation being commonly 
shown by the seventeenth-century grammarians; but in some cases 
(halt, salt, malt), it has been retained; in others, an /1/ has been reintro- 
duced in spelling and pronunciation (fault, falcon, emerald, soldier, 
realm) or merely in the spelling (calm, palm, balm). The loss of /I/ in 
could, should, would, occurred in eModE. 
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(5) Advice to foreign learners.—Few foreign learners will possess in 
their own language [l] and [t] sounds with the RP distribution. It is true 
that, since there is no phonemic opposition between [l] and [t] in 
English, learners will be perfectly intelligible if they use only [l]. But 
those who wish to avoid a foreign accent and whose English otherwise 
conforms to an RP pattern should learn to make the dark [+]. In the 
articulation of [t] there should be no ‘curling back’ of the tongue, as is 
so often advised in books on English. To overcome such a habit already 
acquired, the tongue-tip may be gripped between the teeth during 
practice. The essential feature of [t] may be said to be the accompanying 
weakly rounded [o] or [9] quality; learners should, therefore, begin by 
pronouncing a vowel of the [o] type for the syllabic [+] in words such as 
bubble, people, awful, i.e. where a labial consonant precedes [tł], 
thus—[babo], [pizpo], [o:fo]. A pure vowel of this kind is likely to 
occur in their own language. Such a pronunciation will come near to 
that used by many English speakers (see (3) above). The same sound 
sequence should then be attempted with the tongue-tip touching the 
upper teeth ridge, thus producing a lateral sound with the correct 
velarized quality. The relationship of [t] and [o] can further be estab- 
lished by practising the alternation of [o]-[t]-[o]-[t], only the tongue-tip 
moving and the [o] resonance continuing. The [tł] thus achieved should 
then be used in the examples of [t] given in (1) above, first the syllabic 
cases and then the non-syllabic. 

Care should also be taken to use a sufficiently devoiced [l] after 
accented (aspirated) /p, k/. Accented /p, k/ are distinguished from /b, 
g/ mainly by their aspiration; 1t is important that this aspiration-cue 
should be made clear in the sequences /pl, kl/ by the voicelessness of the 
/1/. If this is not done, such a word as plot, pronounced with a fully 
voiced /1/, may be understood as blot. Pairs for practice, relying largely 
for the opposition on voiceless v. voiced [l], are: plot, blot; plead, 
bleed; plight, blight; clad, glad; class, glass; clean, glean; clue, glue. 

Some learners, especially Jugoslavs and some others speaking Slav 
languages, should avoid using a lateral in pre-vocalic positions which is 
over-velarized for RP. 


8.8 Post-alveolar Approximant (Frictionless Continuant) 
ws 
(1) Examples 
(regularly spelt r, rr e.g. ‘red, carry’; also wr, rh, e.g. ‘write, 
rhythm’) 
word initial—reed, rag, raw, rude, rut, road, royal, rear 


word medial, intervocalic—mirror, very, arrow, sorry, hurry, furry, 
arrive, diary, dowry, dairy, eery, fury 
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word final (/r/-link with following word beginning with a vowel)— 
far away, poor old man, once for all, here at last, there are two 

in consonantal clusters 
(following fortis accented plosive = [1])—price, proud, tree, try, 
cream, crow; expression, surprise, attract, extremely, decree 
(following fortis fricative, unaccented fortis plosive, or accented 
fortis plosive preceded by /s/ in the same syllable = somewhat 
devoiced [1])—fry, afraid, throw, thrive, shrink, shrug; apron, 
nitrate, buckram, cockroach’; sprint, sprat, street, strain, scream, 
scrape 
(following lenis consonant = [1]—fricative after /d/)—brief, 
bright, dress, dry, grey, grow; umbrella, address, agree, hungry; 
comrade, sovereign, general, miserable 

words containing more than one /r/—brewery, library, arrowroot, 
retrograde, rarer, treasury, gregarious, procrastinate 

Compare—/r/, /\/—raft, laughed; rush, lush; red, led; right, light; 
pirate, pilot; sherry, Shelley; two rocks, two locks; crash, clash; 
pray, play; fry, fly; grew, glue; bright, blight 


(2) Description 

(a) Articulatory and distributional features.—The most common 
allophone of RP /r/ is a voiced post-alveolar frictionless continuant (or 
approximant) [1]. The soft palate being raised and the nasal resonator 
shut off, the tip of the tongue is held in a position near to, but not 
touching, the rear part of the upper teeth ridge; the back rims of the 
tongue are touching the upper molars; the central part of the tongue is 
lowered, with a general contraction of the tongue, so that the effect of 
the tongue position is one of hollowing and slight retroflexion of the 
tip. The air-stream is thus allowed to escape freely, without friction, 


LL 


Uf 


d 





FIG. 43.—/r/ = [a]. 


l In the case of /tr/, [r] is likely to be completely voiceless unless a junctural boundary 
occurs between /t/ and /r/. 
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over the centre part of the tongue. The lip position is determined largely 
by that of the following vowel, e.g. reach with neutral to spread lips, 
root with rounded lips. This allophone of the RP phoneme 1s, there- 
fore, phonetically vowel-like, but, having a non-central situation in the 
syllable, it functions as a consonant. 

RP /r/ occurs only before a vowel, the above description being 
applicable to the realizations: initially before a vowel; following a lenis 
consonant (except /d/), either in a syllable initial cluster or at word or 
syllable boundaries; word final /r/ linking with an initial vowel in the 
following word. (See §11.3.7.) This limited distribution applies also to 
other non-rhotic accents. (See §6.3.2(4)(d).) 

When /d/ precedes /r/,' the allophone of /r/ is fricative, the /d/ 
closure being post-alveolar and its release slow enough to produce 
friction, e.g. in drive, tawdry and, in rapid speech, at syllable or word 
boundaries, e.g. headrest, bedroom, wide road. 

A completely devoiced fricative [1] may be heard following accented 
/p, t, k/, e.g. price, try, cream, oppress, attract, across.! A partially 
devoiced variety of /r/ occurs: when /r/ follows an unaccented fortis 
plosive initial in a syllable and, in rapid speech, at syllable or word 
boundaries, e.g. upright, apron, paltry,' nitrate, beetroot, cockroach, 
acrobat; in the syllable initial sequences /spr-, str-, skr-/, e.g. spring, 
string, scream; after other accented fortis consonants, e.g. fry, thrive, 
shrink. Slight devoicing may also occur, in rapid speech, following 
these latter fortis fricatives, when they are unaccented and final in the 
preceding syllable or word, e.g. belfry, saffron, necessary /‘nesasri/, 
surfriding, birthright, horse-race, mushroom. 

(b) Acoustic features.*-—In addition to a low frequency (voice) 
component, the voiced frictionless continuant type of /r/ has a three- 
formant structure comparable with that of vowels, with a steady state 
which appears to have a necessary duration of 50—60 msecs. Typical 
formant frequencies are: Fl, between 120 and 600 cps, the lower the 
frequency, the greater the perceived impression of lip-rounding; F2, 
generally higher than that for /1/, showing a rising transition from a 
steady state of from 840 to 1,560 cps before [1, e, e], 840 to 1,200 cps 
before [9], and 600 to 1,200 cps before vowels of the [o, u] type, strong 
labialization being again associated with the lower frequencies; F3, 
which provides an essential cue to recognition, with a rising transition 
froma steady state near to that of F2, 1.e. 840— 1,920 before [i, e, e] and 
not higher than 1,680 cps before [9, o, u]. The duration of the transition 
of F2 and F3 must be greater than 50 msecs (below which the [r] is heard 
as a flap) and may still be heard as [1] with a duration of up to 300 


l| /tr/ and /dr/ sequences may bc treatcd phonetically as affricatcs when close-knit (see 
§8.3.2). 
¿See O’Connor, et al., op. cit. 
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msecs; Fl transitions, on the other hand, may be very short (even as 
short as the 10 msecs characteristic of /1/) without losing the [1] value. 

(3) Variants.—There are more phonetic variants of the /r/ phoneme 
than of any other English consonant. Within RP, the frictionless con- 
tinuant variety [1] may be replaced by an alveolar tap [r] in intervocalic 
positions, e.g. very, sorry, marry, Mary, for ever, following /8, 0/, e.g. 
three, forthright, with respect, and also, with some speakers, after 
other consonants, especially /b, g/, e.g. bright, grow. In the case of 
intervocalic [r], a single tap is made by the tip of the tongue on the 
alveolar ridge, the side rims usually making a light contact with the 
upper molars. The articulation of [r] differs from that of /d/ in that the 
contact for [r] is of shorter duration and less complete than that of /d/, 
the central hollowing of the tongue typical of [r] being retained; carry 
with [r] and caddy with /d/ are, therefore, phonetically distinct. In the 
case of [r] following /8, 0/, a tap is made by the tongue-tip on the teeth 
ridge as the tongue is withdrawn from its dental position; again, the 
contact is of very brief duration and some tongue hollowing is retained. 

In RP, too, the degree of /abialization varies considerably. Although 
for perhaps the majority of RP speakers the lip position of /r/ is deter- 
mined by that of the following vowel, some speakers labialize /r/ what- 
ever the following vowel. In some extreme cases, lip-rounding is accom- 
panied by no articulation of the forward part of the tongue, so that /r/ 
is replaced by /w/ and homophones of the type wed, red, are produced. 
Such a pronunciation, once a fashionable affectation (e.g. in the late 
eighteenth and early nineteenth centuries) is now heard only amongst 
children who have not yet acquired [1], or as a defective substitution for 
[1]. (The labio-dental frictionless continuant [v] is also commonly 
substituted for [1] in defective speech.) 

A lingual trill (or roll) [r] may also be heard amongst RP speakers, 
but usually only in highly stylized speech, e.g. in declamatory verse- 
speaking. This trill, i.e. a rapid succession of taps by the tip of the 
tongue on the alveolar ridge, is often considered typical of some 
Scottish types of English, though a single tap is more common in the 
colloquial forms of both Scottish English and some Northern speech. 

A uvular articulation, either a trill [R] or a fricative [g], may be heard 
in the extreme north-east of England and also among some Scottish 
speakers and as a defective substitution for [4]. 

Finally, the degree of retroflexion of the tongue for [1] may be greater 
than in RP—[{], e.g. in the speech of the south-west of England and 
some American English. The retroflexion of the tongue may anticipate 
the consonant and colour the vowel articulation; such ‘r-coloured’ 
vowels may occur in this type of speech in such words as bird, farm, 
lord. 

/r/ differs in various types of English not only in its phonetic realiza- 
tion but also in its distribution. Most regional forms of speech, remote 
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from the London area or unaffected by the social influence of RP, 
retain the earlier post-vocalic (both pre-consonantal and pre-pausal) 
usage of /r/, distinguishing between such RP homophones as pour, 
paw; court, caught. (Some older RP speakers still retain a vocalic 
element representing the former /r/, keeping pour /poa/ distinct from 
paw /p3:/.) The post-vocalic [r] used will correspond to the types 
given above according to the region. A retroflexed continuant, of some- 
what greater duration than the pre- or post-vocalic form, may 
also have syllabic function, e.g. in water, father, ladder, paper, etc. 
For the use of word-final, post-vocalic /r/ as a linking form in RP, 
see §11.3.7. 

A sequence of unaccented /r/s, separated by /9/, is frequently 
reduced in rapid speech by the reduction of /ra/ to syllabic /r/ or by its 
total elision (see §9.6). 

(4) Chief sources.—PresE /r/ derives from OE [r, rr] (rise, rope, 
green, nearer)—with later loss in post-vocalic positions (horse, four); 
OE [hr], [xr] or [r] (ring, roof)—[h] or the voiceless nature of [r] being 
lost by early ME; OE [wr] (wrap, write, wrest, wretch)—[w] being 
commonly omitted in the sixteenth century and generally by the late 
seventeenth, producing homophones with rap, rite, rest, retch; from 
OF [r] (rule, remain, fruit, very)—with later loss in post-vocalic 
positions (arm, poor). It is likely that the quality of /r/ in OE and ME 
was that of a linguo-alveolar trill or tap as described in (3) above. Its 
vibratory nature Is described by writers of the sixteenth and seventeenth 
centuries, but /r/ had for some time exerted an influence on the 
preceding vowel and an [a] resonance is identified in the sixteenth 
century. By the seventeenth century it is probable that the trill or tap 
was used only before vowels and that the approximant [1] occurred, 
with or without friction, finally and before consonants. The change 
from a fricative to a non-fricative [1], then to [ə], and finally to dis- 
appearance or merging with a preceding vowel in post-vocalic positions 
is a natural sequence. Its loss in post-vocalic positions in educated 
speech of the south-east of England is likely to have taken place by the 
end of the eighteenth century—considerably earlier in unaccented 
syllables and generally in popular speech. As has been stated above, 
total loss of earlier post-vocalic /r/ is restricted socially and regionally 
even to-day. 

(5) Advice to foreign learners.—RP /r/ has a quality which is rarely 
encountered in other languages, the usual frictionless continuant 
variety having much in common with a vowel. Any strongly rolled [r] 
sound, whether lingual or uvular, is not acceptable in RP, although it is 
not a question of loss of intelligibility through phonemic confusion. A 
weak lingual trill is the least objectionable substitution for RP [1]; but a 
uvular sound, trill or fricative, as so often used by French or German 
learners, is generally taken as a characteristic of a marked foreign 
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accent, despite the fact that a uvular sound is not unknown in English 
regional speech. 

A foreign learner should, therefore, try to abandon his own preju- 
dices as to what an [r] sound should be and approach the RP [1] as if it 
were a vowel. The correct retraction of the tongue may be achieved by 
placing the tip on the rear part of the upper alveolar ridge, then slightly 
lowering the tip, at the same time keeping the side rims in contact with 
the upper molars. Once the feeling of slight curling-back of the tip and 
hollowing of the centre of the tongue has been achieved, the student 
should hold this position and pronounce the vowel /3:/ of bird. This 
sound, with exaggerated length and weak breath effort, may then be 
used initially in a word such as red. It is important that [1] should be 
made unusually long in this position until the tongue articulation is 
established. 

Alternatively, practice may start from /3/, in the articulation of 
which the tongue has a position somewhat similar to that of [1], 
although the sound is fricative, the narrowing between tongue and roof 
of the mouth made too far forward, and the tongue hollowing and 
lateral contraction missing. From the /3/ position, the tongue should 
be retracted, hollowed, and slightly lowered, so that the friction is lost. 
With both methods, it is often helpful to hold the jaws widely separated 
and the lips somewhat rounded. It is particularly important that those 
whose own r-sound is of the uvular type should not make a double 
articulation—uvular and post-alveolar at the same time. The post- 
alveolar affricates /tr, dr/ may also be related to /tf, d3/, applying the 
same principles of retraction as just described (see also §8.3.2). 

Examples for practice should be chosen according to the degree of 
difficulty and the phonetic nature of the /r/ allophone used. Thus, the 
fricative variety of the second element of the affricates /tr, dr/ may be 
the first to practise (to establish the post-alveolar position), but the 
sequence /str-/ will give greater difficulty; intervocalic [1] usually 
presents relatively little difficulty, especially as a one-tap [r] is always 
permissible; the frictionless continuant, characteristic of initial posi- 
tions, may be the most troublesome articulation of all. 

Learners should not be misled by the spelling into pronouncing post- 
vocalic letter r. In words such as car, arm, horse, hurt, etc., the r-letter 
may be taken as a sign indicating length of the preceding vowel, and in 
fear, there, tour, etc., as a sign of the [a] element of the diphthong. 
Nevertheless, in connected speech, the final linking /r/ form should 
normally be used; no special attempt to use an ‘intrusive’ /r/ need be 
made. (See examples for practice in (1) above.) 

As in the case of /1/, it should be remembered that in the sequences 
/pr-, br-, tr-, dr-, kr-, gr-/ the oppositions between fortis and lenis 
plosives are indicated mainly by the degree of voicing in the following 
/r/. Thus, /pr-, tr-, kr-/ should have [1], especially when accented, if 
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they are not to be confused with /br-, dr-, gr-/; cf. such pairs as pray, 
bray; try, dry; crow, grow. 

Some learners, such as the Japanese and other Asians, whose own 
language does not have an opposition of the /l/-/r/ type, should 
practise words distinguishing /l/ from /r/ (see (1) above). 


8.9 Semi-vowels 


(1) Articulatory and distributional features. —A semi-vowel is a rapid 
vocalic glide on to a syllabic sound of greater steady duration. In 
English the semi-vowels /j/ and /w/ glide from positions of approxi- 
mately /1:/ (with spread or neutral lips) and /u:/ (with rounded lips) 
respectively, e.g. in year, west, Oswald [ vzwtd], spaniel [ ‘speenjt]. The 
actual point at which the essential vocalic glide begins depends on the 
nature of the following sound, e.g. the glide of /j/ to /1:/ in yeast has a 
closer beginning than that of /j/ to /p/ in yacht, and the starting point 
of /w/ before /u:/ in woo is closer than that before /p/ in what. When 
/3/ is followed by a back close vowel as in you, or /w/ by a front close 
vowel as in we, the starting points need not be as close as in yeast and 
woo, since in the first cases the glide is essentially of a front to back (or 
vice versa) direction, rather than a movement of close to more open, as 
in the latter cases. In English, however, it is never necessary for the 
starting point of /3/ + /1:/ or /w/ + /u:/ to be so close that it falls 
within the fricative region beyond the vowel area, since English /1:/ and 
/u:/ are both sufficiently relaxed for a perceptible non-fricative glide to 
be made from a closer position within the vowel area. 

Despite the fact that semi-vowels are, in phonetic terms, generally 
vocalic, they are treated within the consonant class, mainly because 
their function is consonantal rather than vowel-like, i.e. they have a 
marginal rather than a central situation in the syllable. /j/ and /w/ 
occur initially or in an initial cluster preceding a syllabic sound (see §7.2 
for the treatment of final [1] and [u] elements of English diphthongs as 
an integral part of the diphthongal glide rather than as separable— 
consonantal—/j/ and /w/). Their consonantal function is emphasized 
by the fact that the articles have their preconsonantal form when fol- 
lowed by /j/ and /w/, 1.e. the yard, a yacht, the west, a wasp, with /09/ 
or /o/ rather than with /01/ or /an/. 

In addition, the allophones of /j/ and /w/, when following a fortis 
consonant, are voiceless and fricative, as in cue, quick [kju: ], [kwik], 
1.e. they fall within a phonetic E of a consonant. 

(2) Acoustic features.'—Since /j/ and /w/ are vocalic glides (except 
in the case of the fricative allophones mentioned above), they may be 


1See O’Connor, et al., op. cit. 
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expected to have acoustic features similar to those of vowels, i.e. a 
characteristic two or three formant structure similar to that of /i:/ or 
/u:/. In fact, as for vowels, two formants are sufficient for good recog- 
nition. Compared with /r, 1/, the steady state of the semi-vowels is very 
short, e.g. of the order of 30 msecs. F1 starting point of the glide is that 
of /1:/ or /u:/, i.e. about 240 cps; F2 has a starting point within the 
range 2,280—3,600 cps for /j/, depending on the following vowel, and 
within the range 360—840 cps for /w/, depending on the following 
vowel. The transition duration of F2 is of the order of 50—100 msecs for 
both /}/ and /w/, with that of F1 of the same or shorter duration. 


8.9.1 Unrounded Palatal Semi-vowel 
ae 
(1) Examples 


(spelt y, i, e.g. ‘yes, spaniel’; also /ju /, spelt u, ew, eu, eau, ul, 
e.g. ‘muse, new, feud, beauty, suit’) 

word initial = [3] —yield, yes, yard, yacht, yawn, union, young, 
yearn, yokel, year, Europe 

following accented /p, t, k, h/ (only before /u:, ua/) = [¢]—pew, 
tune, queue, cure, pure, huge; accuse, secure, peculiar, attuned 

following /sp, st, sk/, fortis fricatives or unaccented /p, t, k/ = 
slightly devoiced [j] —spurious, stew, askew; enthusiasm, pursue, 
refuse; opulent, spatula, oculist; help you, quick yield 

following lenis consonant = [j]—beauty, duty, music, new, value, 
view; abuse, endure, argue, manure, onion, failure, familiar, 
residue, senior, behaviour 


(2) Description.—The vocalic allophones of RP /}/ are articulated 
by the tongue assuming the position for a front half-close to close vowel 
(depending on the degree of openness of the following sound) and 
moving away immediately to the position of the following sound; the 
lips are generally neutral or spread, but many anticipate the lip- 
rounding of the following vowel in such cases as you, yawn, etc. When 
/}/ follows a fortis consonant, devoicing takes place; especially when 
/}/ follows accented /p, t, k, h/, the devoicing is complete, with the 
result that a fortis voiceless palatal fricative [¢] 1s produced. (In these 
latter cases, it is the friction rather than the glide which identifies the 
phoneme.) 

When /}/ is the final element of accented clusters, only /u:/, /ua/ or 
sometimes /9:/ may follow /j/ (pew, cure); in unaccented clusters, /j/ 
may be followed by /u:, u, Ua/ or /a/ (argue, opulent, tenure, senior). 
The sequence /h/ + /j/ as in hue /hju:/ [h¢u:] may coalesce into [ç] 
({cu:]). Such a realization entails oppositions between /}/, /h/, and [ç], 
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FIG. 44.—/]/. 


raising the possibility of phonemic status for [¢]—you, who, hue. The 
number of words offering the sequence /h/ + /j/ > [ce] is, however, 
restricted (Hugh, hew, hue, human, humour, etc.), and alternative 
pronunciations with /h/ + /j/ or /h/ + [e] (on the pattern of /p, t, k/ 
+ /j/) are possible. [c] is, therefore, more conveniently treated as a 
realization of /h/ + /j/. 

(3) Variants.—In many cases of RP /j/ + /u:/, an alternative pro- 
nunciation without /j/ exists! (and is extended further in American 
English). Earlier /ju:/ or /ru/ sequences (see §7.9.9) have regularly been 
reduced to /u:/ in PresE after /tf, d3, r/ and /I/ preceded by a con- 
sonant; /ju:/ is retained after plosives, nasals, /f/, /v/, and /h/ (pew, 
beauty, queue, argue, tune, dune, few, view, nephew, huge), and when 
/\/ is preceded by an accented vowel (value, curlew); but in other cases, 
more variation is possible, both /u:/ and /ju:/ being heard, e.g. in 
absolute(ly), lute, salute, revolution, enthusiasm, pursuit, assume, suit, 
suet, Suitable, superstition, supermarket, consume, presume, etc, 
though /u:/ grows increasingly common in such words being the more 
common after /l/ and /s/ in a stressed syllable whilst /ju:/ remains 
predominant after /0, z/. 

Unaccented sequences of /tj, dj, sj, zj/ coalesced in an earlier state of 
the language into /tf, d3, f, 3/ (see §§8.3.1, 8.4.5). In some cases, e.g. 
statue, residue, issue, seizure, Christian, immediate, educate, grati- 
tude, usual, visual, Jesuit, both forms may now be heard, the pronun- 
ciation with /tj, dj, sj, zj/ being characteristic of careful speech; on the 
other hand, the occasionally heard coalesced forms with /f/ in assume 
and /3/ in presume are regarded as old-fashioned. Such coalescences 


! See Daniel Jones, Outline of English Phonetics, $8.17. 
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also occur in rapid, familiar, speech, as unaccented word boundaries, 
e.g. in not yet, would you, this year, sees you (§11.3.4). 

In unaccented syllables there is often variation between /ja/ and 
/1-2/, e.g.in immediate, India, audience, tedious, idiot, hideous. In the 
cases involving a preceding alveolar plosive where the /ja/ form may be 
regarded as the primary one, this form may occasionally be further 
reduced to /tf, d3/ in rapid speech, e.g. /1mi:djot, 1'mi:dzot, 
'mi:diot/. In such cases as Romania, Bohemia, Australia, morphia, 
/19/ tends to be retained in careful speech, as well as in those suffixes 
where /2/ has a separable morphemic value, e.g. in easier, heavier. 

A [j] glide is sometimes heard at a point of front vowel hiatus, where 
the first vowel is /i:/ or a diphthong ending in [1] (especially /a1, 91/), 
e.g. seeing, saying, sighing, fire, enjoy it [sijm, 'sejin, ‘sajin, fajo, 
in'd3oj it], as well as in the regional realizations of /1, e9/ as /Ijo, 
eja/ in such words as dear, there [dija, deja]. Such a perceptible [J] 
results from the relatively close quality given to /i:/ and [1] and the 
rapid glide to the following, more open, vowel; vowel sequences con- 
taining /i:/ or [1] are, however, generally so relaxed that such a glide is 
not heard (see also 87.12). 

(4) Chief sources. —PresE /j/ derives from OE [j] (itself deriving 
from a palatalized form of [g]) (yard, year, yield, yoke, young). PresE 
/\/ (+ /u:/) also derives from ME [liu, eu] (see §7.9.9) and from loss of 
syllabic value of [1] before another vowel in unaccented situations, e.g. 
in such words of French origin as opinion, familiar, onion, William, 
pavilion, where the ¿-spelling had at an earlier stage represented the 
palatal nature of French /p, A/. 

(5) Advice to foreign learners.—RP /j/ presents little difficulty, 
provided that the starting point of the glide is not so close as to produce 
friction in those situations where /j/ should be purely vocalic; such 
incorrect friction is most likely to be used before relatively close front 
vowels, e.g. in yeast, yes. Spanish learners, in particular, should avoid 
using a palatal plosive [3] when /j/ is strongly accented, e.g. in yes, 
young [jes, jan]. /3/ should be correctly devoiced after accented /p, t, 
k/, e.g. in pew, tune, queue. 


8.9.2 Labial-velar Semi-vowel 


/ 


/W/ 
(1) Examples 


(spelt w, wh; or u after q, g—west, which, quick, language; note 
“one, once, choir, suite? with /w/) 

word initial = [w]—weed, wet, wag, wasp, wood, womb, one, 
word, wave, woke, wire, weird, wear 


216 An Introduction to the Pronunciation of English 


following accented /t, k/ = [w]—twig, twelve, twin, twice, queen, 
quell, quick, quite, quaint, acquaint 

following /sk/, accented fortis fricative or unaccented /p, t, k/ = 
slightly devoiced [w]—square, squash, squirrel; thwart, swim, 
swear, swoon; upward, outward, equal; pump water, that word, 
take one 

intervocalic, or following lenis consonant = [w]—away, aware, 
inward, always, language; dwindle, dwarf, Gwen 

possible /hw/ or /m/—wheat, whether, which, what, white 

possible oppositions /w/, /hw, m/—witch, which; weather, 
whether; wine, whine; Wales, whales; wear, where 

Compare /w/, /v/—west, vest; wine, vine; worse, verse; wall, veil; 

welr, veer 





Fic. 45.—/w?/. 


(2) Description.—The vocalic allophones of RP /w/ are articulated 
by the tongue assuming the position for a back half-close to close vowel 
(depending upon the degree of openness of the following sound) and 
moving away immediately to the position of the following sound; the 
lips are rounded! (more closely when followed by /u:, u/ or /3:/ than 
when preceding a more open or front vowel—cf. woo, wood, war, with 
what, west, we; in those cases where /w/ precedes /u:/, the lip- 
rounding for /w/ is closer and more energetic than that associated with 
/u:/, permitting a distinction between such a pair as ooze, woos). The 
soft palate is raised and the vocal folds vibrate; but when /w/ follows 
a fortis consonant, some devoicing takes place: when /w/ follows 


! This is an example of double articulation, the approximation of the articulators at the 
bilabial and velar places of articulation constituting two strictures of equal rank. 
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accented /t, k/, the devoicing is complete = [w] or [m] (a fortis voice- 
less labial-velar fricative—the friction being bilabial). In this latter 
case, it is the bilabial friction rather than the glide which identifies the 
phoneme; such words as swoop, swoon, are distinguished from soup, 
soon, not only by the stronger lip action associated with /w/ but also by 
its partially devoiced friction. 

Consonants preceding /w/, especially initially in an accented 
syllable, will be lip-rounded in anticipation of /w/, e.g. in twist, queen, 
swing, language, conquest, and to a lesser extent at syllable or word 
boundaries, e.g. in onward, bindweed, front wheel, this one. 

(3) Variants.—The main variant, both in RP and in other types of 
British English, concerns the pronunciation of the spelling form wh. 
Amongst careful conservative RP speakers and regularly in several 
regional types of speech, e.g. in Scottish English, words such as when 
are pronounced with /hw/, or, more usually, the fortis voiceless labial- 
velar fricative [m]. In such speech, which contains oppositions of the 
kind wine, whine, shown in (1) above, /M/ has phonemic status. 
Among RP speakers, however—especially males—the use of /m/ as a 
phoneme has declined rapidly (though it is often taught as the correct 
form in verse-speaking). Even if /m/ does occur distinctively in any 
idiolect, it may nevertheless be interpreted phonemically as /h/ + /w/ 
(cf. the treatment of [c] in $8.9.1(2)). However, the fact that the stock 
of words in which /M/ may be opposed to /w/ (e.g. in whale, what, 
wheel, when, where, whet, which, whig, whin, whine, whirr, whist, 
whit, why) is greater than those in which [ç] is opposed to /j/, provides 
some argument in favour of a monophonemic rather than a biphonemic 
solution for /mĪm/. 

A [w] glide is sometimes heard at a point of vowel hiatus where the 
first vowel is /u:/ or a diphthong ending in [u] (especially /au/), e.g. 
doing, following, our, allow it [duwin, folowin, awa, 2 law ıt], as well 
as in the regional realizations of /9a/ (= RP /2:/) as [owa] in such 
words as door [dawa]. Such a perceptible [w] results from the relatively 
close quality given to /u:/ or [u] or [9] and the rapid glide to the follow- 
ing vowel; vowel sequences containing /u:/, [u] or [9] are, however, 
usually sufficiently relaxed for such a glide not to be heard (see also 
§7.9.9). 

Some RP speakers omit /w/ in the context /k-9:/, e.g. in quart 
(= caught) and quarter. 

(4) Chief sources. —PresE /w/ derives from OE [w] (way, wolf, 
wash, widow, twin, dwarf); from OF [w]—a northern French form, 
where Mod French has /g/ (wage, ward, warrant, war); from an earlier 
French [u] preceded by a velar plosive and followed by another vowel 
(squire, squirrel, squadron). In many cases an earlier [w] has been lost 
(so, such, thong, two, sword, answer) during the ME or eModE 
periods, and in the unaccented terminations of such place-names as 
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Chiswick, Dulwich, Greenwich, Woolwich; [w] in the cluster [wr] 
(write, wreck, wrist) was finally lost in the late seventeenth century. 

PresE /m/ derives from OE [hw], [xw] or [w] (spelt Awin OE) (whale, 
wheel, where, when, which, whistle). [hw] or [m] finally merges with 
/w/ in educated southern speech in the late eighteenth century, 
although still deplored by normative elocutionists; /w/ occurs for [hw] 
much earlier—probably in ME—however, in popular speech. The 
reduction of [hw] to /w/ is parallel to that of [hr, hn, hl] to /r, n, 1/; the 
relatively smaller number of French words entering the language with 
/w/ (compared with the numerous examples with initial /r, n, 1/) may 
be a reason for the longer persistence of a /hw/-/w/ opposition, the 
imbalance between the voiceless and voiced varieties being not so 
marked as in the case of /hr, hn, hl/ v. /r, n, I/: 

In the case of who, whom, whose, it is the [w] element which has been 
lost (probably by the eModE period), due to a merging of the [w] with 
the following, similar, /u:/. The addition of initial /w/ in one, once 
(cf. alone), occurred from late ME onwards in popular speech, existed 
alongside [o:n, o:ns] forms in eModE, and was finally adopted as 
regular by the end of the seventeenth century. 

(5) Advice to foreign learners.—/m/ being of restricted usage in a 
relatively small number of words, there is no need for the foreign 
learner of RP to adopt this sound for wh- words. It is, however, 
important that the vocalic allophone of /w/ should not be replaced by a 
consonantal sound, i.e. either [B], a voiced bilabial fricative, or [v], a 
voiced labio-dental fricative, in which the lower lip articulates with the 
upper teeth, or [v], alabio-dental approximant, in which there is againa 
loose approximation (without friction) between the lower lip and the 
upper teeth. All such substitutions will be interpreted by the English ear 
as /v/. The learner should protrude and round his lips, ensuring that 
the teeth play no part in the articulation; if necessary, in practice, an 
energetically rounded full [u:] vowel should be used, e.g. wine being 
pronounced as [u:aln], and a clear distinction being made between this 
word and vine (see examples in (1) above). The same protruded and 
rounded lip action (and absence of lower teeth contact) applies to the 
voiceless allophone [w], as in quite, twin, etc. As in the case of the 
voiceless allophones of /l, r, j/, it is important that /w/ should be 
devoiced especially after accented /t, k/, despite the fact that there are 
no exact pairs depending on the opposition [dw, gw]-[tw, kw], cf. 
dwell, twelve; distinguish, relinquish; dwindle, twin; Gwen, quench. 


8.10 The Frequency of Occurrence of RP Consonants! 


As a class, the alveolar phonemes emerge as those which occur most 
frequently in English, this being a generalization which appears to be 


! See D. B. Fry, op. cit. In this investigation /tr/ and /dr/ were treated as a sequence of 
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applicable to many languages. The following order of frequency of 
occurrence has been established by D. B. Fry!:— 


% % 
/n/—7:58 /b/—1-97 
/t/—6:42 /f£/—1-79 
/d/—S-14 /p/—1-78 
/s/—4-8] /h/—1-46 
/1/—3 -66 /y/—1-15 
/0/—3:56 /g/—1-05 
/t/—3°5] /{/—0:96 

/m/—3-22 /1/—0:88 
/k/— 3-09 /d3/—0-60 
/¡w/—2:81 /t$/—0:41 
/z/—2-46 /0/—0:37 
/v/—2:00 /3/—0:10 


Total all consonants: 60-78% 


It is interesting to note the discrepancies of occurrence between the 
fortis and lenis members of homorganic pairs of phonemes: thus, /s, 0, 
k/ occur considerably more frequently than their counterparts. This 
general discrepancy of frequency of occurrence is, however, less 
important, as far as oppositional significance is concerned, than the 
frequency of opposition in any particular position in a context. More- 
over, in any general frequency count such as this, the order obtained 
will reflect the occurrence of such ‘common’ words as the, that, which 
etc., giving preponderance to /0, w/, for example, as against /0,j]/. As 
is to be expected from its historical origins and its restricted contextual 
distribution, /3/ occupies the lowest position. 


/t/ or /d/ and /r/; the frequency of occurrence of these three phonemes would be slightly 
lower if /tr/ and /dr/ were taken as composite (monophonemic) entities. 

l In the original article, an error arose in the figures for /t/, /d/, and /r/, resulting in a 
total discrepancy of 1-19%. These figures have been corrected by Mr. G. Perren 
(formerly of British Council, London), and the total discrepancy reduced to 0-01 %. The 
listed quoted above includes the revised percentages for /t/, /d/, and /r/. 





Part Ill 
The Word and Connected Speech 


Introduction 


We have seen that speech, a manifestation of language, may most 
readily be investigated at the physiological and physical levels. Such 
investigations show, however, that it is by no means easy to analyse the 
data obtained for any speech event into discrete, successive units within 
either the articulatory or the acoustic patterns. Speech must, therefore, 
be considered from the phonetic point of view as an ever-changing con- 
tinuum of qualities, quantities, pitch, and intensities. If, for practical 
purposes, e.g. in a phonetic/phonemic transcription of the spoken 
language, we treat speech as a succession of articulatory or auditory 
separable units, it is largely because we Impose, upon the gross material 
of speech, entities which we have derived (consciously or uncon- 
sciously) from a knowledge of the linguistically significant oppositions 
Operating for any particular language-system, 1.e. the phonemic 
categories. In other words, a useful phonetic/phonemic account of 
speech describes those articulatory or acoustic features which are 
significant in distinguishing the phonemes (or the important allophones 
within a phoneme) of a language. It should not, however, be forgotten 
that such a linear presentation is an abstraction from the concrete 
material of speech rather than a statement of the gross phonetic 
material composing the continuum. In this respect, the sophisticated 
written form of English differs from the spoken manifestation of the 
language, for our writing explicitly represents a succession of discrete 
linguistic units—phonemes (nowadays imperfectly) and words. 

If, however, for convenience, our analysis is based on discrete 
phonemic units, it is necessary to take into account the way in which 
such units combine in speech—both in the word and in longer utter- 
ances; thus, the aim of the following sections is to interpret features of 
the continuum in terms of the combinatory influence of qualitative and 
quantitative units and of the pitch and intensity relationship of such 
units. 
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9 
The Word 


9.1 Accent 


The word, composed of one or more phonemes, has a separate 
linguistic identity, in that it is a commutable entity, higher than the 
phoneme, which may either constitute a complete utterance or may be 
substituted in a longer utterance for other words of its same class. It 
may be thought of as a shape or pattern, constituted from the quali- 
tative and quantitative elements of its phonemes, these phonemic 
elements being also capable of distinctive commutation, e.g. the /t/, 
/d/, and vowel length commutation in write, ride, or writer, rider. In 
addition, in polysyllabic words, the word shape has an identity deter- 
mined by the relationship of its parts. Thus, writer, rider, may be said to 
have a pattern consisting of a ‘strong’ syllable followed by a ‘weak’ 
syllable: ə .. This pattern obtains also for written /'ritn/; but, in the 
case of return /ri't3:n/ or again /a gen/, the pattern is. e. It may be 
said, therefore, that the identity of return, again, depends not only on 
the different qualitative and quantitative elements which distinguish 
these words from, for example, written, but also on the different 
patterns of the total word-forms which derive from the varying 
prominence of the parts. The syllable or syllables of a word which stand 
out from the remainder are said to be accented, to receive the accent. 

The accentual pattern of English words 1s fixed, in the sense that the 
main accent always falls on a particular syllable of any given word,! but 
free, in the sense that the main accent is not tied to any particular 
situation in the chain of syllables constituting a word, as it is in some 
languages, e.g. to the penultimate syllable in Polish, to the first in 
Czech, and to the last in French. Thus, the main accent falls regularly 
on the first syllable in such words as finish, answer, afterwards; on the 
second syllable in behind, result, together, impossible; on the third 
syllable in understand, education; or later in articulation, palataliza- 
tion, etc. 


l With certain exceptions, determined by the larger rhythmic pattern of the total 
context (see §11.2). 
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The accentual shape of a word, in terms of the degree of prominence 
associated with its parts, 1s a reality for both the speaker and the 
listener; but the speaker’s impression of the factors which produce such 
a pattern of varying prominences may differ from the actual auditory 
cues by which the listener perceives the prominence pattern. It 1s, 
therefore, necessary to examine the factors which in English are 
significant for the speaker and for the listener in producing the com- 
municated effect of prominence. 


9.2 Prominence 


Any or all of four factors—stress, pitch, quality, quantity —may render 
a syllable more prominent than its neighbours. Not all have equal 
validity for speaker and listener alike; they are not all equally efficient 
as markers of the accented syllable. 

(1) Stress.—A sound or syllable which is stressed is one upon which 
there is expended in the articulation relatively great breath effort and 
muscular energy; in voiced sounds, greater amplitude of vibration of 
the vocal folds, together with the reinforcing resonation of the supra- 
glottal cavities, results in physical terms in relatively great intensity of 
the sound or syllable, such intensity being perceived by the listener as 
greater loudness associated with the sound or syllable. Thus, the greater 
energy which the speaker feels is concentrated on the first syllable of 
answer may be manifested for the listener as greater loudness. The 
speaker may feel several degrees of articulatory energy in producing a 
polysyllabic word, e.g. in the word examination, the syllables may be 
articulated with the following descending order of energy /nel/, /zxe/, 
/19/, /mi/, /fn/. Nevertheless, the English speaker will normally 
distinguish readily only stressed and unstressed (strong and weak) 
syllables, to which will correspond the listener’s impression of loud and 
weak syllables. 

(2) Pitch change.—Stress/loudness is not, however, by itself an 
efficient device for signalling the situation of the accent. Thus, when 
they are said on a monotone and with unexaggerated quantities, it is 
difficult to distinguish by stress alone insult (n.), import (n.), billow, 
from insult (v.), import (v.), below, in which the different accentual 
pattern is not associated with marked qualitative changes. The accen- 
tual cue becomes clear, however, when stress is associated with pitch 
prominence; the verb and noun forms of insult, for example, are easily 
distinguished by pitch patterns, e.g. with a falling intonation, 








e, against 4 ., or, with arising intonation, . «y against » 


_— = 


Pitch changes will also manifest more than one stressed syllable 





in a word; thus, examination with , ° "a , or, ® ° p” 


or, within a phrase such as the first examination was over 
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E E , where the stressed syllables show a 





change of pitch in relation to the preceding syllables. Nevertheless, 
among stressed syllables, some are more likely than others to be 
associated with pitch change; thus, when examination is pronounced 


with the pitch pattern. . , Aa , , the syllable /zx/ (noted above 


as the syllable felt to be articulated with the second highest degree of 
energy) may have no pitch prominence in relation to its neighbours. 
Such accentual prominence as it has for the listener is, therefore, 
achieved to a large extent by stress alone; this type of accentuation may 
be called secondary, rhythmic, or non-tonic. The syllable /nei/, on the 
other hand, will always be associated with a pitch change when the word 
is said in isolation; this type of accentuation may be called primary, 
nuclear, or tonic. The pitch prominence of word primary accent may be 
lost in connected speech, e.g.:— 


. . 9 . O o o 0 . A . 


The examination was almost over 


but the syllable of primary accent is always that on which there is a 
potential change of pitch. Such a change of pitch direction may also be 
associated, but less typically, with those syllables which more charac- 
teristically have secondary accentuation, e.g. examination, said in an 
indignant, emphatic manner, might well have the pitch pattern 


. A , e ,,inwhich case /zæ/ and /nel/ are equal in both stress 
and pitch prominence. (Abnormal pitch prominence may often be 
given to a weakly stressed syllable, when, with a falling-rising pitch 
pattern, the fall may take place on the primarily accented syllable and 
the rise on a following, normally unstressed and unaccented, syllable, 





e.g. under e, J officer q , J.) 

The most common relationship of pitch and stress in word accentua- 
tion may be summarized as follows:— 

Primary accentuation—associated with a potential change of pitch 
direction. 

Secondary accentuation—not normally associated with a change of 
pitch direction, but may be 

(a) prominent, i.e. signalled by a potential change of pitch /evel, or 

(b) rhythmic, 1.e. having no pitch prominence, but with rhythmical 

stress, and often quality/quantity, prominence. 

Stress and pitch variations combined may, therefore, be said to con- 
stitute a complex which is most powerful in signalling the situation and 
degree of the accent in a word. 

(3) Sound quality.—The actual sequence of phonemes constituting a 
word also contributes to the listener’s impression of an accentual 
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pattern, i.e. to the relative prominence of sounds and syllables. In a 
sequence of phonemes (said on a monotone with equal stress and 
length), vowels will normally be more prominent than consonants; 
among the vowels, prominence increases as the vowel becomes more 
open; among the consonants, those which are vowel-like, e.g. /m, n, D, 
l, r/, stand out from the remainder, while fricatives have a higher 
prominence than plosives. Thus, if a nonsense word composed of 
English consonants and short vowels, such as [ilplelz], is presented to 
English listeners without pitch, length, or stress variations, 1t usually 
happens that [p] and [æ] are judged to be the points of greatest promi- 
nence. However, it should be noted that a different sound inserted ina 
repetitive sequence of similar sounds may tend to stand out from its 
neighbours, although its ‘inherent’ sonority is less than that of its 
neighbours, e.g. [1] in such a sequence as [eneneninen«], where the 
contrasting feature of [1] is not so much its quality as its difference from 
the environment. 

In addition to the prominence of sounds, due to their nature or the 
character of the context in which they occur, certain English phonemes 
are particularly associated with unaccented situations.' Thus RP /3/ 
does not normally occur in accented syllables; and /1, U/, though both 
may receive full accentual prominence, have a high frequency of occur- 
rence in unaccented syllables. The other English vowels may also occur 
in syllables which do not receive the primary accent (see §7.13), but they 
may be associated in the speaker’s and listener’s minds with some 
degree of secondary accent (especially /z, p/ and the long vowels and 
diphthongs) by virtue of their greater qualitative prominence (and also, 
when separated from a primary accent by one or more unaccented 
syllables, for reasons of rhythm)—compare, for example, the accentual 
prominence of the last syllables in never /'neva/ and nephew/ 
/'nevju:/, wrecker /'reka and record /'rek3:d/; or of the first syllables 
of contain /Kan'tem/ and canteen /keen'tizn/, aghast /a'ga:st/ and 
august (adj.) /d:'gast/. 

This association of vowel quality and prominence leads sometimes to 
variation on the speaker’s part both as to the qualities used and to the 
situation of the pitch prominence. Thus, especially in syllables 
preceding the primary accent, ‘strong’ and ‘weak’ vowels may occur 
according to the speaker’s feeling of strong or weak stress (a feeling of 
weak stress resulting in secondary accent or lack of accent), e.g. /u:/, 
/u/, or /3/ in the first syllable of July, /au/, [o], or /3/ in the first 
syllable of November, proceed, /e/ or /3/ in the first syllable of 
September. Alternatively, especially in disyllables each of which may 
contain a strong vowel, e.g. in contact (v.) /‘kontekt/, adult /‘zedalt/, 
there may be hesitancy as to the situation of the pitch prominence 


'For a full treatment, see G. F. Arnold, Stress in English Words, 1957. 
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denoting primary accent; if the prominence is placed on the Do 
syllable, there is likely to be weakening of the first vowel, 
/kən'tækt/, /ə'dalt/. It is to be noted that, despite relatively a 
stress, there is often no choice between strong and weak vowel, when 
the vowel occurs in the syllable immediately following the primary 
stress, €.g. phoneme, insect, syntax, placard, diphthong, record (n.), 
perfume, hubbub, expert, female, hero, convoy, etc., all retain a strong 
vowel in the final syllable, however weakly stressed by the speaker. 

It is also to be noted that in those types of English, e.g. Northern 
regional, where the first syllable of such words as obtain, contain, con- 
tinue, expect, has a strong vowel rather than the /9/ or /1/ used in RP, 
an impression of unusual accentuation on the first syllable is given to 
RP listeners. 

Quantity.—If the nonsense word [ilolelee] is presented to English 
listeners, with no pitch variation but with vowels of different length, 
those vowels which have the greatest length will be judged most promi- 
nent despite their quality. Thus, [1], of relatively weak prominence in 
quality alone, may stand out from the other vowels of the word if 
pronounced [1:]. 

The RP long vowels and diphthongs (particularly in their non- 
reduced forms) will always, therefore, be associated with prominence, 
especially by a listener, however weakly stressed by the speaker they 
may be, and despite the fact that a long vowel or diphthong in a rela- 
tively unaccented syllable does not have the same length that it would 
have in an accented syllable. This is the case of the vowels in the final 
(post-tonic) syllable in such words as phoneme /i:/, placard /a:/, 
foreword /3:/, railway /e1/, compound /av/, invoice /91/, etc.; for 
most speakers, in the pre-tonic syllable of idea /ai/, sarcastic /a:/; 
and, more particularly, when a rhythmic stress 1s associated with such 
vowels in a syllable remote from the tonic, e.g. finally in pedigree /1:/, 
photograph /a:/ (or /#/), telescope /av/; initially, in parliamentary 
/a:/, perpendicular /3:/, photographic /2U/. A vowel in an 
unaccented syllable may in fact be longer than that in an adjacent 
accented syllable, without depriving the latter of its prominence, e.g. 
/“1/ and /av/ in pillow, /'œ/ and /a1/ in ally, 'a/ and /19/ in frontier, 
EIC. 

(5) Conclusion.—It will be seen that any or all of these four factors 
may play a part in rendering a sound or syllable prominent. In speech, 
however, it is pitch variation which is the most commonly used and 
efficient cue of prominence for the listener. Pitch prominence is not, 
however, necessarily associated with the syllable having the highest 


pitch thus, in examination, said questioningly and querulously as 
SO e , the accented syllables achieve their prominence from 
their ds low pitch. 
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Length variation is also a strong contributory factor both as regards 
the association of vowel quantity with accentuation and also as a 
feature of prominence in its own right (note, in this respect, the use of 
length to replace pitch in the intoning of prayers in church, e.g. in ‘and 
the blessing of God be upon you’ with the lengthening of the short 
vowels of bless, God, upon /v/). Sound qualities also contribute to an 
impression of prominence, but mainly through the characteristic 
relationship of certain qualities with unaccented syllables and others 
with accented syllables. 

Stress, strictly defined in terms of energy of articulation for the 
speaker and of loudness for the listener, is the least effective means of 
conveying prominence. Indeed, the syllable uttered with the greatest 
intensity may not be the one with the greatest accent or prominence; 


thus, insult (n.), said in questioning and indignant fashion as e `, 


may have more energy expended on /salt/ than on /in/, but /In/ 
remains for the speaker and listener the accented syllable. It should be 
noted, however, that stress has a particular reality for the speaker which 
is not communicated directly to the listener. The speaker feels a 
rhythmic beat in uttering a word, which is associated with his mental 
(linguistic) usage of the word and extended to the variation of energy 
used in the articulation of the word. This subjective, rhythmic, pattern 
may evoke an equivalent response in the listener’s perception of the 
word, but rather through the listener’s linguistic recollection (in terms 
of the system which he shares with the speaker) than through the 
exchange of signals susceptible of acoustic analysis. It is in this sense 
that an accented syllable may be felt as stressed by the speaker and 
interpreted thus by the listener though it has no physical existence; for 
example, in the case given by Daniel Jones,! Thank you may be realized 
as /'kkju :/, where the stressed and accented syllable consists of silence. 


9.3 Word Accentual Patterns? 


Since certain syllables of a word are more prominent than the others for 
any of the reasons mentioned above, it may be said that a word has a 
characteristic accentual or rhythmic pattern for speaker and listener 
alike. This basic pattern, which is as much a part of a word’s identity as 
lts sound sequence, may, however, be somewhat modified by the 
general accentual pattern of the longer utterance in which it occurs (see 
$11.2). In the following selection of patterns for polysyllabic simple 
words (1.e. excluding words composed of separable root morphemes 


| Outline of English Phonetics, $909, footnote. 
¿See also R. Kingdon. The Groundwork of English Stress, Longman, 1958; and G. F. 
Arnold, op. cit. 
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or words, but including those containing an affix), the accentual 
elements are marked thus:— 


> — syllable receiving potential primary (tonic, nuclear) accent; 

e » o —Syllable receiving potential secondary (pitch prominent or 
rhythmic) accent; in pre-nuclear syllables, such secondary 
accent 1s frequently manifested by actual or potential pitch 
prominence,! in post-nuclear syllables, secondary accent 
may be manifested, when adjacent to tonic, merely by 
qualitative or quantitative prominence—marked as 0, or, 
when remote from tonic, together with a rhythmical beat— 
marked e. 

- —unaccented syllable (associated usually with the ‘weak’ 
vowels /3, I, U/ or with syllabic /m, n, 1/). 


Pattern Examples 
2 syllables 


e Y —unknown, Chinese, idea, antique, thirteen, canteen, cashier, 
chastise 

è> —alone, machine, arrive, behind, invent, reform 

o —profile, placard, female, invoice, programme, window 

. —over, under, husband, valley, rhythm, cotton, table 


Y g - 


Pattern Examples 
3 syllables 


e . y —understand, cigarette, magazine, entertain, personnel, 
seventeen, afternoon 








% . . —quantity, yesterday, innocence, bachelor, wanderer 
% . e —appetite, pedigree, photograph, cataract, telephone 
- Y . —important, encounter, excessive, relation, eleven 
e > —uncertain, re-entry, foreknowledge 
è% o —tobacco, tomato, embargo, prehensile (or e % o) 
Pattern Examples 
4 syllables 
E Sa ee remarkable, impossible, affirmative, establishment, 
photography 
e Y . + —rhinoceros, unfortunate, diocesan 


l Note that this type of syllable carries the main accent of a word manifesting ‘stress 
shift’. (See §11.2). 
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y 


5 syllables 


Pattern 


6 syllables 


Pattern 


7 or 8 syllables 


0.0000. 


—acetylene, acclimatize, enumerate 

—unimportant, insufficient, circulation, diplomatic, 
photographic 

—cowardliness, melancholy, caterpillar, criticism 

—telescoping, appetizing, educated, helicopter 

—counterattack, capitalize 

—aquamarine, photogravure 

P(O e 


. . ¥ )—misrepresent, misunderstand 


Exarnples 


—affiliation, consideration, apotheosis 
—antimacassar, circumlocution, interdependence 
—satisfactory, objectivity, aristocracy 
—rehabilitate, inexactitude, incapacitate 
—catholicism, administrative, empiricism 


Examples 


—inferiority, impossibility, convertibility 

—characteristically, variability, meteorological 

—uncooperative, ceremoniously, indistinguishable 

—palatalization, nationalization, autobiographic 
(or withe . o . Y.) 

—personification, identification, electrification 


Examples 
: —unilateralism 
o —unreliability 
> —enthusiastically (/-kal1/) 
. —industrialization 
. —impenetrability 


Y . —Internationalization 


The stress patterns of some of the derived forms listed here are 


predictable, as certain general rules apply with regard to suffixing. 
When certain suffixes are added to a word, the root retains its origi- 
nal primary accent; such stress-neutral suffixes include -ness and 
-ism. Thus po'lite, po'liteness; ‘cowardly, 'cowardliness, A'merican, 
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Americanism; ‘token, 'tokenism. Other suffixes such as -esque and 
-ette are stress-attracting (‘autostressed’), the accentual pattern of such 
words as pictu'resque and launde'rette manifesting stress on the suffix; 
whilst others such as -ity are ‘pre-stressed’, the primary accent of words 
such as ‘rapid, re liable being replaced in the derived forms ra'pidity and 
relia bility by accents on the syllable immediately preceding the suffix.' 
See §9.7. 

It will be seen from the selection of English word accentual patterns 
given above that there is a certain rhythmic tendency towards alterna- 
tion of accented and unaccented syllables. Such a regular rhythmic 
pattern is often, however, disturbed by the meaningful prominence 
given to a prefix, e.g. mis-, un-, in-, etc., or by the retention of a strong 
vowel quality or the presence of a long vowel or diphthong in what 
might rhythmically be expected to be an unaccented syllable. The 
conflict between considerations of rhythm and other factors giving an 
impression of accentual prominence results in numerous cases of 
pattern instability (see §9.4). 

Compound words (i.e. words composed of separable root 
morphemes, whether or not the compound is hyphenated in the 
spelling) normally contain a single primary (nuclear, tonic) accent: on 
one element of the compound, the other element or elements carrying 
secondary accent. The most common type of compound has the 
primary accent onthe first element. In the case of disyllabic compounds 
of this kind, the secondary prominence of the second element is usually 
mainly qualitative or quantitative (i.e. lacking the rhythmic beat which 
may fall on syllables remote from the nucleus): — 


Y o —air-raid, bandstand, backache, blackbird, blackmail, bird- 
cage, bookcase, bridesmaid, bulldog, cardboard, catmint, 
chain-gang, clothesbrush, coal-mine, crossword, duckpond, 
earthquake, flowerpot, footprint, gas-mask, greenfly, 
heartburn, highbrow, knee-cap, lifeboat, miail-bag, 
nightdress, offprint, route-march,  tail-coat, teapot, 
washstand, windscreen 


(In some cases of such compounds, the second element is unaccented 
and has the weak vowel /2/, e.g. ‘boatswain, coalman, chairman, foot- 
man, postman, saucepan’.) 

In three-syllable compounds, the pattern may be % o + (with 
qualitative or quantitative prominence on the second syllable) or 

Y . e(with rhythmic and qualitative/quantitative prominence on 
the third syllable), e.g.:— 


Y o .—air-worthy, bluebottle, dance-music, dog-collar, forerunner, 


! For a full account of affixes and stress, see Erik Fudge, English Word Stress, Allen 
and Unwin, 1984. 
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gold-digger, grasshopper, grandfather, handwriting, 
housekeeper, landlady, lawnmower, manslaughter, 
newspaper, postoffice, shopkeeper, steam-roller, step- 
ladder, stock-piling, take-over, tape-measure 

X . +. —booby-trap, borderline, bowling-green, butterscotch, 
buttonhole, chambermaid, chatterbox, chimney-sweep, 
copyright, drawing-pin, fire-brigade, gingerbread, honey- 
moon, ladybird, mortar-board, office-boy, waterspout, 
walting-room 


Other word patterns may be illustrated as follows: — 


X . e . —antechamber, booking-office, carpet-sweeper, dinner- 
jacket, engine-driver, knuckleduster, lighthouse- 
keeper, organ-grinder, power-station, season-ticket, 
tape-recorder 


Yo. . e —musical-box, catherine-wheel, labour-exchange 
. Y o . —dispatch-rider, machine-minder 

doo. . —fire-extinguisher 

do... .  —Ccabinet-maker 


In other types of compound the first element does not bear the 
primary (nuclear) accent, but has normally a secondary accent deriving 
from potential pitch and rhythmic prominence, e.g.:— 


e y —back-fire, beam-ends, black-lead, downstairs, first-class, free- 
wheel, full-grown, ground-floor,- henceforth, knee-deep, 
mincepie, north-west, short-term 

e y . —archbishop, blackcurrant, churchwarden, coldblooded, 

cross-question, downhearted, field-marshal, flat-footed, 
good-looking, half-volley, horse-chestnut, lawn-tennis, 
pig-headed, pot-bellied, safekeeping, shop-window, 
short-sighted, well-meaning 

o . Y —country-house, double-cross, ginger-beer, secondhand 

o . y . —broken-hearted, easy-going, indiarubber, mass-produc- 

tion, sergeant-major 

o Y . . —post-graduate, vice-chancellor 

o y . e .—hot-water-bottle, waste-paper-basket 


Compounds of this latter kind are sometimes said to have ‘double’ or 
‘level stress’, but 1t should be noted that, although both elements 
may show a nuclear pitch change, e.g. free-wheel e e ore el ,such 
a falling or rising pitch change is more usually associated with the 
second element SN or .J . The effect of the association of pitch 








prominence of some kind with both elements is, however, to make both 
elements of the compound stand out, with corresponding meaningful 


prominence. Compounds of the first kind, e.g. crossword e œo , with 


The Word 233 


most of the prominence and meaningful weight concentrated on the 
first element, are often referred to as carrying a “single stress’, which 
unifies rather than opposes the elements. 

It will be noticed from the examples listed that there is a tendency for 
noun + noun compounds to have a different stress pattern from 
adjective + noun compounds. Most commonly in noun + noun (A + 
B) compounds, it is the first element that takes the primary accent. Thus 
grapefruit ® o . However, if B is made of A, the second element:is 
more likely to be accented. Thus lemon squash. . y (squash made 
of lemons) but mineral water 4 . e «(not water made of minerals), 
this again examplifying the most usual pattern for noun + noun 
compounds. ! 


9.4 Word Pattern Instability 


The accentual patterns of words are liable to change. Considerable 
changes of this kind have taken place within the last three hundred 
years, in addition to the large-scale accentual shifts affecting French 
importations in ME. Thus, in the seventeenth century, and still in 
American English, a secondary accent with a strong vowel fell on the 
penultimate syllables of such words as necessary, adversary, 
momentary; the noun and verbal accentual opposition between, for 
example, abuse(n.)e .andabuse(v.). «has been lost, though in this 
pair of words a qualitative/quantitative distinction is maintained in the 
second syllable; we no longer place the primary accent on the second 
syllable of revenue, illustrate, confiscate, character, etc., or on the first 
syllable of humidity, convenient, prosperity, etc. 

Hesitancy and variation of pattern occurring at the present time are 
the result of rhythmic and analogical pressures, both of which entail in 
addition considerable changes of sound pattern in the word.? 

(1) Rhythmic changes.—In some words containing more than two 
syllables there appears to be a tendency to avoid a succession of weak 
syllables, especially if these have /3/ or /1/. Thus, in words of three 
syllables, there is variation between Y . .and. ẹ .patterns,e.g.° 
exquisite */‘ekskwizit/ or /iks'kwizit/, deficit */‘defisit/ or /difisit/, 
integral */'mtigral/ or /in'tegral/, mischievous */'mistfivas/ or 


1 A detailed account of the construction-type and stress is given in Fudge op. cit. 

2 These remarks apply mainly to RP and to the patterns of isolate words rather than 
those variants occurring in connected speech (see §11.2); they do not take into account 
patterns used in other speech forms, e.g. in Scottish English, enquiry /enkwir1/, realize 
/rialaiz/, advertisement / gedvar taizmant/ : 

3 Where there is an accepted ‘correct’ pattern, it is marked here with * in the trans- 
cription; the marks ‘and ¡indicate primary ( Y ) and secondary (e or o) accent respec- 
tively, but see $86.3.3 and 6.3.4 above. 
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/mis'tfizvas/ (vulg. /muis'tfi:vies/). inculcate */ kal keit/ or 
/1n'kal keit/; note, too, acumen /'ækjumen/ or /a kju:men/, sonorous 
/'sonaras/ or /sa no:ras/, precedence / presidns/ or /prisi:dns/, where 
è . . increasingly replaces the traditional . % . pattern, and the 
pattern instability of such words as importune */im'po:tju n/ or 
/ impo: 'tju:n/, premature /'prematjua/ or /, prema tjua/. 

Similarly, in words of four syllables, there is variation between the 
patterns- è> . +». .OF } =, e and -- %» <=... Or. > PA 
e.g. controversy */'kontravasi/ or /kantrovasi/, hospitable 
*/'hospitabl/ or /ha spitabl/, despicable */'despikabl/ or /dispikabl/, 
formidable */'fo:midabl/ or /famidabl/, capitalist */'keepitalist/ or 
/ka pitalist/, aristocrat */ geristo kreet/ or /a rista,kreet/. Note, too, 
variation between. Y . . and èe . X . in such words as centri- 
fugal */se (or /3/) ntrifjugl/ or /sentrifju:gl/, centripetal 
*/sentripitl/ or /sentri'pi:tl/, metallurgy */mi tæləd31/ or 
/\meta13:d31/ or /'meta,l3:d3i/. A relatively new word such as tele- 
vision now appears to have the pattern Y . . . predominantly, the 
variant e . Y . being less common. 

Longer words, too, often exhibit a tendency towards the alternation 
of accented and unaccented syllables with various rhythmic patterns, 
e.g. tuberculosis /,tju:bakju'lousis/ or tju,b3:kjulousis, articulatory 
/,a:tikjuleitari/, /a:(or /9/),tikju leitan/ or /,a:(or /2/)tikjulatri/, 
inexplicable */wm'eksplikabl/ or /,nik'splikabl/. 

(2) Analogical changes.—It sometimes happens that a word’s 
accentual pattern is influenced not only by rhythmic pressure but also 
by the accentual structure of a similar word of frequent occurrence. 
Thus, the analogy of the root forms apply /a plai/, prefer /prif3:/, 
compare /kam'peo/, is responsible for the realization of applicable, 
preferable, comparable, as /a'plikabl, pri f3:rabl, kam'peerabl/ instead 
of */'zplikabl, 'pref(o)rabl, komp(a)rabl/. Again, the existence of con- 
tribution, distribution / kontribju: fn, ,distri'bju: fn/ may account for 
the pronunciation /‘kontri,bju:t, distribju:t/ (contribute, distribute) 
instead of the more usual (‘correct’) */kan'tribju:t, dis tri,bju:t/, 
where the first syllable is totally weakened and the last retains only a 
certain qualitative prominence. The case of sucha word as dispute (n.), 
with the form /'dispju:t/ instead of the more usual /di'spju:t/, 
exemplifies an analogous form comparable with the accentual opposi- 
tions of verb and noun in disyllables (see following section), i.e. 
/‘dispju:t/ (n.) as against /di'spju:t/ (v.), cf. discount /'diskaunt/ 
(n.), /diskaunt/ (v.). Primary accent is also unstable in such 
compounds as shop-steward (è o or e Y), weather-forecast 
(d>. e. . Ore . Y 2.) 
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9.5 Distinctive Word Accentual Patterns 


The accentual pattern of a word establishes the contrastive relationship 
of its parts; it may also have a distinctive function in that it opposes 
words of comparable sound structure (and identical spelling). Such 
word oppositions (for the most part disyllables of French origin) may or 
may not involve phonemic changes of quality. 

(1) A relatively small number of words! exhibit oppositions of 
accentual pattern which are manifested mainly by a shift of pitch or 
stress prominence, with slight variations of quantity. In situations 
where the prime signal of prominence—pitch—is not operative, the 
oppositions are either totally lost or are inefficiently marked by stress 
alone, or are maintained by an exaggeration of the quantity variation. 
Thus, there being no significant qualitative change, the noun and verb 
forms of the following words are normally distinguished primarily by 
the situation of the pitch-prominent syllable; discount, import, 
impress, incense, increase, inlay, insult, offset, overhang—pitch pro- 
minence on the first element for nouns, on the second element for 
verbs. In other cases, some speakers may make a reduction or obscura- 
tion of the vowel quality in the first syllable in the case of the verbal 
form, e.g.:— 


Noun/Ad)jective Verb 
abstract /'geb streekt/ /¡eeb'streekt/ or /ab'streekt/ 
accent /'eek sent/ or /'eeksnt/  /,eek'sent/ or /ak'sent/ 
digest /'da1d3est/ / dar d3est/ or /di'd3est/ 
torment /'toz,ment/ / to: ment/ or /ta ment/ 
transfer?  /'trazmsf3:/ / tra: ns'f3:/ or /trans f3:/ 
transport?  /'trazms¡po:t/ / tra: n'spo:t/ or /tran'spa:t/ 


(2) More commonly, the change in accentual pattern is manifested 
mainly by a shift in pitch prominence together with a related variation 
of quality. Often, the qualitative change takes the form of a reduction 
of the unaccented vowel of the first element of the verbal form to /3/ or 
/1/; less frequently, by a reduction of the vowel in the second element of 
the noun/adjective form, e.g.:— 


! The small number of disyllables involved in such accentual oppositions is shown by 
L. Guierre, L’accentuation en anglais contemporain (unpublished doctoral thesis), Paris, 
1979. Out of a corpus of more than 10,000 disyllabic words, only 85 exhibited changes of 
verb, noun or adjective function by means of a shift of accented syllable: 72 words oppose 
v. wn. (e.g. convict)—whereas in 186 cases (e.g. profit) functions were not distinguished 
by means of accent; 9 words opposed n/adj. ~ v. (e.g. present) through accent, while 
other oppositions of function through accent were exhibited in compact (n. ~ v./adj.), 
absent, direct, prostrate (adj. N v.). 

2 Also with /traen-/. 


236 An Introduction to the Pronunciation of English 


Noun/Adyjective Verb 
absent /'æbsənt/ /əb'sent/ 
combine /‘kom,bain/ /kom'bain/ 
compress _/ kom pres/ /kam'pres/ 
concert /'konsat/ /kan's3 t/ 
conduct /'kon,dakt/ /kan'dakt/ 
consort /'kon,so t/ /kan'so t/ 
contract /'Kon treekt/ /kon treekt/ 
contrast /'kon,tra st/ /kan'tra st/ 
convict /‘konvikt/ /kan vikt/ 
desert /‘dezat/ /diz3 t/ 
export /'ek,spo t/ /ik'spo t/ 
frequent /'fri kwont/ /frikwent/ 
object /"obd31kt/ /2b d3ekt/ 
perfect /'p3 fikt/ /pa fekt/ 
permit* /'p3 mit/ /pa mit/ 
present /'preznt/ /prizent/ 
proceeds /'prau,si dz/ /pra si dz/ 
produce /'pro dju s/ /pra dju s/ 
progress /'prau,gres/ /pra gres/ 
project /'pro,dzekt/ /pra dzekt/ 
protest /‘prau,test/ /pra test/ 
rebel /'rebl/ /ri bel/ 
record /'re¡ko d/ /riko d/ 
refuse /'re,fju s/ /rifju z/! 
segment /'segmoant/ / seg ment/ 
subject /'sabd3ikt/ /sab'd3ekt/ 
survey* /'s3 ,vei/ /sa ve 
August /3 gəst/ (n.) he ‘gast/ (adj.) 
minute /minit/ (n.) / ¡mal nju t/ (adj.) 


(* In these words, the opposition of /3 / and /93/ in the first syllable 
amounts to an opposition of vowel quantity; it is to be noted that the 
verb fo survey may have the same accentual pattern as the noun in the 
particular sense of ‘to carry out a survey’.) 

Several disyllables do not conform to the general noun/verb accen- 
tual pattern distinction or exhibit instability, e.g. comment / koment/ 
for both noun and verb; contact /‘kon,tzekt/ (n.) and /'kon,tekt/, 
/ kon teekt/ or /kan'tekt/ (v.); detail /'di ,teil/ (n.) and /'di ,teil/ or 
/ di 'teil/ (v.); contrast has a verbal form /'kon,tra st/ in addition to 
the more usual form given above. In all these cases, and in that of 


! The noun and verb forms of refuse differ also in the final consonant and the resulting 
variation of vowel length of /u: /. 


The Word 237 


dispute, already mentioned, the noun form is tending to supersede the 
verbal pattern. 

Words containing more than two syllables also exhibit distinctive 
accentual patterns associated with pitch, stress, and quality cues, e.g.: 


Noun/ Adjective Verb 
alternate Felner nət/ /'9:lto,ne1t/ 

alisimel 
associate /a'sausjat, -slot, -fjat/ /a sausielt, ə səufrert/ 
attribute / etribju: t/ /ə trbju: t/ 
envelope /'enva loup/ /m'veləp/ 

/'envələp/ 
reprimand /'teprıma:nd/ / reprrma:nd/ 
interchange  / antat feind3/ / inta tf eind3/ 
compliment /komplimant/ /,kompli ment/ 

/‘kompli,ment/ 
supplement /'saplimant/ f ri 
| 

estimate /‘estimat/ /‘estimeit/ 
intimate /‘intimat/ /‘inti,meit / 
moderate /'‘modorat/ /'modo,rert/ 
prophesy /'profasi/ /'profi,sal/ 
separate /'separat/ /'sepo,reit/ 
invalid /‘invalid/ (‘sick’) 


/in'veelid/ (‘void’) 


9.6 Elision 


Since OE, it has always been a feature of the structure of English words 
that the weakly accented syllables have undergone a process of grada- 
tion, 1.e. loss of phonemes or obscuration of vowels (see also $87.9.12, 
7.13). The same process of gradation, with resultant contraction, 
may be observed in operation in PresE. It is important, however, 
to distinguish between cases of elision which have been established in 
the language for some time (although the spelling may still reflect an 
earlier, fuller form) and those which have become current only recently. 
In these latter cases, the forms exhibiting elision and syncope are typical 
of rapid, colloquial speech, whereas more formal speech tends to retain 
the fuller form under the preservative influence of the spelling. The 
examples of elided word forms in colloquial speech which are given 
below are independent of the type of reduction affecting particular 
words and syllables under weak accent in connected speech (see §10.4). 
They are also to be distinguished from the reduced forms in various 
types of regional and popular speech, which the educated speaker might 
characterize as vulgar, e.g. recognize as /reka,naiz/, satisfactory as 
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/ sees feektri/, cigarette as /sigret/, possible as /'posbl/, Waterloo as 
/ wo: luz /, lovely as /'lalv/, etc. 


(1) Vowels! 

(a) Established—Loss of vowels under weak accent within the word 
has occurred at various stages of the language’s development and 
is now established, e.g.—initially, state, scholar, sample; medially, 
Gloucester, marriage, halfpenny, evening, chimney, curtsey, forecastle 
/'fauksl/, gooseberry, Salisbury /'so:\zbri/; in a final syllable, time, 
name, loved, hands (< handes), eaten, written, cousin. 

(b) Present colloquial.—In PresE such elision is likely to take place 
especially in a sequence of unaccented syllables, in respect of /a/ and 
/1/. Thus, in post-nuclear (post-tonic) positions, the sequence con- 
sonant + /a/ + /r/ + weak vowel, e.g. in preferable, tends increas- 
ingly to lose the /a/ of the second syllable, /far/ being reduced to /fr/; 
similar reductions affect /ar/ following other consonants, e.g. in 
repertory, temperature, comparable, territory, lavatory, temporary 
/‘tempror1/, anniversary, vicarage, category, factory, robbery, labour- 
exchange /‘leibriks,t feind3/, murderer / 'm3:dra/, customary, camera, 
honourable, scullery, suffering, beverage, rhinoceros, nursery, 
Nazareth, fisheries, treasury, natural /'næt fral/, dangerous, utterance, 
history, ordinary. Though labelled here as ‘colloquial’ these elisions 
may occur regularly within the speech of an individual, the fuller 
version not forming a part of his idiolect. In the same way, there may be 
an elision of a weak vowel following a consonant and preceding /l/, or 
the reduction of syllabic /I/ to syllable marginal /1/, in grappling, 
doubling, fatalist, paddling, bachelor, specialist, usually, insolent, 
easily, carefully, buffalo, novelist, family, panelling, chancellor, etc. 
Note, too, frequent loss of post-nuclear /a/ or /1/ in university 
/ju:niv3isti/, probably /'probbly, difficult /'‘difklt/, national 
/‘neefnl/, fashionable /'feefnabl/, reasonably /'rizznabl/, parliament 
/‘pa:lmant/, government /'gavmant/. A similar process may apply 
with the loss of syllabicity in the present participles of verbs such as 
flavour, lighten and thicken where the /a/ may be elided or the syllabic 
consonant /r/ or /n/ replaced by a non-syllabic consonant marginal to 
the syllable. Thus /flewrin/, /‘laitnin/ and /‘@iknin/ in place of 
/‘fleivarin/, /'laitonin/ and /'@ikanin/ respectively. It may be noted 
that some speakers make a regular distinction between the participle 
with three syllables and the noun of two syllables exhibiting elision. 

In pre-nuclear positions, /3/ or /1/ of the weak syllable preceding the 
primary accent is apt to be lost in very rapid speech, especially when the 


'For absorption of the 2nd element of a diphthong before another vowel (smoothing), 
see §7.11. 
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tonic syllable has initial /1/ or /r/,' e.g. police, Palladium, parade, 
terrific, correct, collision, believe, balloon, barometer, direction, 
delightful, gorilla, galoshes, ferocious, philology, veranda, 
voluptuous, Saloon, Solicitor, syringe, charade, etc.; also, with a con- 
tinuant consonant preceding and a consonant other than /l/ or /r/ 
following, in phonetics, photography, thermometer, supporter, 
suppose, satirical, circumference, perhaps /pheeps/, etc. Note, too, 
the elision of /1/ (common amongst schoolchildren) in geometry 
/‘d3pmatri/, geography /‘d3vgrafi/. 


(2) Consonants 

(a) Established—Apart from such consonants as have been lost 
through vocalization, e.g. OE hlaford > lord, OE wealcan > walk 
/wo:k/, or have lost allophones in certain situations, e.g. /h/ = [x, ç] 
in brought, night, the reduction of many consonant clusters has long 
been established, e.g. initial /wr, kn, gn, hl, hr, hn/ in write, know, 
gnaw, loaf, ring, nut; medial /t/ + /n/ or /I/ in fasten, listen, often, 
thistle, castle; and final /mb, mn/ in lamb, hymn, etc. 

(b) Present colloquial.—In PresE simplification of clusters 
continues to take place, especially involving the loss of the alveolars 
/t, d/ when medial in a cluster of three consonants. In the following 
examples, retention of the alveolar plosives is characteristic of careful 
speech: exactly, facts, mostly, handsome, windmill, handbag, wrist- 
watch, friendship, kindness, landlord, lastly, restless, landscape, 
Westminster, coastguard, dustman, perfectly (and more rarely /t/ in 
such words as attempts, prompts). /8/ is normally elided from asthma 
and isthmus but, like /f/ and /8/, may sometimes be omitted from 
months, twelfths, fifths, cloths; and in rapid speech elision of /k/ in 
asked and /l/ in only may occur. 

[t] is apt to be lost when preceded by /9:/, which has a resonance 
similar to that of [t], e.g. always /'o: wiz/, already / 9:redi/, although 
/,9: 6av/, all right /'9: rait/, almanac /'9: ma,nek/ (cf. established loss 
of [t] in walk, talk, etc.). 

/p/ may be lost in clusters where its position is analogous to that of 
an epenthetic plosive e.g. glimpse /glims/. 

Whole syllables may be elided in rapid speech, especially in the 
vicinity of /r/ or where there is a sequence of [r] sounds, library 
/'aibri/, February /'febri/, literary /litri/, meteorological 
/ mi: tra lod31k1/. 


1Such elisions in word initial syllables are more likely when the preceding word, 
belonging to the same group, ends in a vowel, e.g. the police /da:pli:s/, I believe /al 
bli: v/, but local police /laukl pali:s/, can’t believe /‘ka:nt bali: v/. 
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9.7 Word Accent—Advice to Foreign Learners 


The rhythmic pattern of a word, and especially the situation of the 
syllables carrying the primary (tonic, nuclear) accent and the pretonic 
secondary accent, should have as much importance attached to It as to 
the sequence of phonemes composing the word. Despite the fact that an 
English listener will tend to interpret a distorted sound or accentual 
pattern (as when a foreigner pronounces develop as / devalap/) in terms 
of his own (correct) pattern and is alded in this adjustment by the 
meaning of the total context, 1t may nevertheless happen that a word 
pronounced with the correct sound sequence may well be misunder- 
stood if the relative prominence of the syllables 1s incorrect. Learners 
should be careful to make the appropriate reduction of unaccented 
vowels and are recommended to use a form with total elision of a weak 
vowel in such possible cases as are given in the preceding sections. They 
should pay particular attention to the role of variation of quality in 
those words which are distinguished from others by a shift of accent 
(see $9.5) without, however, extending such variation of accentual 
patterns indiscriminately to all disyllables, e.g. report, delay, select, 
reserve, account, etc., have the same pattern in both verb and 
noun/adjective functions. 

Moreover, the accentual pattern of an extended word form should 
not necessarily be associated with that of the root upon which it 1s 
based: compare, for instance, photograph /'fauta,gra: f/ with photo- 
graphy /fatografi/; nation /'neifn/ with national /'neefnal/, nation- 
ality /nefneeliti/, nationalistic / nefnalistik/; origin /'orid3in/ with 
original /a rid3anal/. 

It is possible to give rules governing the relationship of accentuation 
and the spelling of English polysyllabic words (see, for instance, 
Kingdon, 1958; Chomsky and Halle, 1968; Guierre, 1979), but these are 
of necessity extremely complex if they are to have general validity. The 
foreign learner will no doubt become accustomed to learning the accen- 
tual patterns of words as he meets them for the first time, but there are 
certain relatively simple rules (see Guierre, 1970; Fudge, 1984), involv- 
ing the influence of word endings on accentuation, which, though not 
without exceptions, have sufficient general applicability to serve as a 
useful guide to a learner. 

For instance, in the case of the following endings, the primary accent 
falls on the preceding syllable: 

-ity: ‘quality, nationality, university, etc. 
-ion, -ian: discussion, in'tention, per fection, ma'gician, negoti'ation, 
phone tician, etc. 
-ic: dra matic, emphatic, phonetic, hi'storic, etc. (Note, however, 
the exceptions such as ‘Arabic, ‘catholic, etc.) 
-ify (v.): ‘terrify, ‘justify, identify, etc. 
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-ible: in'credible, ‘terrible, i llegible, ON 
-igible: ‘negligible, 'dirigible, in telligible, etc. 
-ish (v.): finish, 'publish, 'flourish, etc. 

Again, verbs of 3 or more syllables ending in -ate, throw the main 
accent back two syllables, e.g. ‘penetrate, indicate, ne gotiate, etc.; 
whereas verbs of 2 syllables ending in -ate place the accent on -ate, e.g. 
in flate, trans late, de'bate, dic'tate, etc. 


9.8 Phonotactic Possibilities! 


English does not exploit, in the word and the syllable, all the possible 
combinations of its phonemes. For instance, long vowels and diph- 
thongs do not precede final /n/;* /e, e, a, D/ do not occur finally; the 
types of consonant cluster permitted are subject to constraints. 
Initially, /n/ does not occur; no combinations are possible with /tf, d3, 
0,z/;/r,j, w/ can occur in clusters only as the non-initial element; such 
initial sequences as /fs, mh, stl, spw/ are unknown, etc. Finally, only 
/1/ may occur before non-syllabic /m, n/; /h, r, J, w/ do not occur in 
the type of phonemic analysis here used; terminal sequences such as 
/kf, fp, 10, 3bd/ are unknown, etc. 

The Tables in §9.9 show the way in which consonants (singly and in 
clusters) combine with the vowel phonemes of English in initial and 
final positions of the word. Many more consonant sequences will 
naturally occur medially at syllable boundaries in polysyllabic words, 
but a knowledge of the tolerated word initial and final sequences will 
usually give a guide to the point- of phonological syllable boundary 
where onset of accent or other phonetic features do not supply the 
solution. Thus, in the pronunciation /‘netfrali/ (naturally), it is rea- 
sonable to assume that the syllable boundary falls between /tf/ and /r/, 
since /-zetf/ and /ra-/ are possible word final and initial sequences, 
whereas word final /-ee/ and initial /tfra-/ do not occur. 

However, although the general pattern of word initial and final 
phoneme sequences is plain, there are certain problems which have to 
be considered before comprehensive tables can be established. 

(1) It is necessary to decide upon the size and the nature of the word 
list which is to be analysed. In particular, a decision must be taken 
concerning the extent to which rare, archaic or foreign words and 
proper names are to be included. A difficulty arises only when such 
words provide examples of unusual combinations of phonemes. Then it 
is a matter of deciding whether the word is of such currency that it must 
be allowed to establish the possibility of a particular sequence of 


!See O'Connor and Trim (1953), and Hultzén, Allen, Miron (1964.) 
21 should be noted that such combinations do occur as a result of assimilation. See 
0 tee 
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phonemes, even though it may furnish the unique example. The Tables 
given below are based generally on the ttems contained in the Everyman 
English Pronouncing Dictionary (Dent, 1977). But the decision is often 
arbitrary, depending upon the native feeling for the ‘Englishness’ or 
otherwise of the sequence. Thus, Gwen and Sarah are used as examples 
of /gwe-/ and /sea-/ but the variant pronunciation /lo:ns/ for Launce 
is excluded as an example of final /-d:ns/; voodoo is allowed to 
exemplify /vu:-/, but fenks (which would provide an instance of final 
/-enks/) and zoril (/zb-/) are excluded on the grounds of rarity; both 
Schweppes and schnapps are discarded as unique evidence of /fw-/ and 
/fn-/ because of their restricted use, nor do tsar, tsetse illustrate initial 
/ts-/ since both have variant pronunciations with /z-/ or /s-/ and /t-/ 
respectively which conform to the more usual English patterns. 

(2) It frequently happens that a word has more than one accepted 
pronunciation. Usually, as in the common variation between /a/ and 
/1/ in certain unaccented syllables, the variants conform to the general 
pattern of combinatory possibilities. Occasionally, however, one such 
varlant pronunciation will provide a unique example of a phoneme 
combination. Thus, the choice of /u/ or /u:/ in groom provides the 
single instance of /gru-/ (if Grundig is excluded); /gru-/ is included tn 
the Tables as tolerable in the pattern of English /gr + V/ sequences. On 
the other hand, width, breadth, hundredth have variants with /-t8/ or 
/-40/; but the latter is excluded in favour of the (more common) 
former, since these words are the only cases of /V/ + /t-d/ + /0/. 
Again, words such as French, range may be said with either /-ntf/ and 
/-nd3/ or /-nf/ and /-n3/. Since the former seem to be most common, 
the latter are excluded as final CC clusters. On the other hand, though 
many speakers would not distinguish the final clusters in pririce /-ns/ 
and prints /-nts/, the distinction is sufficiently widespread for both 
/-ns/ and /-nts/ to be considered as possible final clusters (see also 
§8.4.4). 

(3) Phoneme sequences have different values according to their 
frequency of occurrence in normal discourse. Thus, /gl- / occurs 
before most vowels in a large number of common words, whereas /gj-/ 
is highly restricted (e.g. in the rare gules); similarly, /-ld/ is extremely 
common after most vowels, whereas /-lm/ occurs only after /1, €, A/ 
(the single example of /-ilm/, film, having a high likelihood of 
occurrence, the sole case of /-alm/, culm, being comparatively rare). 

(4) Final sequences which include a syllabic consonant, e.g. in 
rhythm /-19m/, mutton /-atn/, cattle /-cetl/, muddled /-adld/, have 
the syllabic analysed as /9 + C/ for the purpose of the Tables. 

(5) The greatcr complexity of final consonant clusters (as compared 
with those occurring initially) is largely accounted for by the fact that 
final /t, d,s, z/ often represent a suffixed morpheme, e.g. -ed, -s, -’s, as 
in parked, plagued, dukes, leagues, chef ’s ctc. 
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9.9 Tables of Word Initial and Final Phoneme Sequences 
CS 


The following ten vowels constitute monosyllabic words: /i:/ E, /a/ a, 
/a:/ are, /9:/ or, /3:/ err, /e1/ A, /ai/ I, /a0/ owe, /19/ ear, /e3/ air. In 
addition, /i:/ and /u:/ occur as reduced forms for he and who, and 
/31/ may occur in the exclamation Oy! 


(2) Initial V 


All vowels occur initially, with the exception of /U/ and /ue/.' 


(3) Initial CV-(Table I) 


/m/ does not occur initially. /3/ occurs initially before /1/, /1:/, /e@/ 
and /a:/ in such foreign words as gigolo, gigue, Zhivago, jabot, genre. 
The other consonants generally occur before all vowels, though marked 
deficiencies are evident before /ua, u, 91/. The following show more 
than 3 empty vowel slots: /z/ (12), /0/7 (9), /0/ (6). 


(4) Initial CCV- (Table I) 


(a) Initial CC clusters pattern as follows:— 


p +l,r,] 
t + Tr,],w 
k +1,r,J, w 
b+lr,] 
d+ r,jJ,w 
g +1l,r,], w 
m + j 
n + j 
l + j 
f +l,r,j 
Vv + j 
0 + Tr,J,w 
s +l, j,w,p,t,k,m,n,f 
f+ r 

h + j 


(b) /Cj/ occurs only before /u:/ or /ua/ or occasionally /u/. Initial 
/tw, dw, gw/ occur before only a restricted set of vowels; /hw/ is 
excluded as an initial RP sequence, the variant /w/ being more 
common. Only /sp/ occurs before all vowels; only CC + /u:/ (6) has 


! However, there are well-established pronunciations of such foreign proper names as 
Uppsala /upsa:la/ and Urdu /‘vadu:/. 


Word Initial CV- 


Table I 


al 31 JU au 19 €93 Ud 


el 


- 


+ + 
+ + 
+ + 
+ + 
+ + 

E 
+ + 
+ + 
+ + 
+ + 
+ + 
+++ 
+++ 
+ + 
+ + 
+ + 
+ + 
+++ 
+ + 
+++ 


— mo. 


J 
w 


Note:—The following items are included: kursaal /kuə-/, July /d3u-/, lure /lua/, Ruritania /ruə-/, voodoo, Thetis, Thark, thane, sourdine 


/sua-/, Sarah, Zambia, jalousie, genre, houri, Yare. 
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less than 10 empty slots, whereas /€9, oI, 3:, 19, 9, at, aU, Ud/ are 
deficient in more than 20 places. 


(5) Initial CCCV- (Table ITI) 
(a) Initial CCC clusters pattern as follows:— 


s+p+lr,]) 
+t+  r,]J 
+ k+l,5r,], w 


(b) /s/ is the essential first element of CCC clusters; the second 
element is a fortis stop; the third element must be one of /l, r, j, w/. Of 
the 12 potential CCC sequences /spw-, stl-, stw-/ do not occur. /CCj/ 
occurs only before /u:/ or /ua/; /skl-/ occurs only before /13/, though 
the item sclerosis admits the variants /skle-, skli-, sklo-/. 

(c) The name of the bird smew provides a single example of the initial 
sequence /smj-/. 


(6) Final -V 


All vowels except /e, æ, A, D/ occur finally. /u/ occurs word finally in 
to before a word beginning with a vowel or in pre-pausal position (as 
well as in certain pronunciations of a word like statue). No short vowels 
occur in stressed open syllables. 


(7) Final-VC (Table IV) 


/rt,h,j, w/ do not occur finally in the present phonemic analysis of RP. 
/3/ occurs finally only after /1:, az, uz, e1/ in words of recent French 
origin; /n/ only after /1, ee, a, D/; /0/ only after /1,1:, uz, el, al, 3U, au/; 
/g/ only after /1, €, 2, A, D, it, 32, az, 92, uz, el, 3U/. Only /d/ occurs 
after all vowels. No vowel occurs before all consonants; /Ua/ (17), /ea/ 
(16), /19/ (15), /91/ (12), /u/ (12) show more than 10 empty slots. 
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Table IV Word Final -VC 
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Note:—The following items are included: yoick, gowk, Prague, morgue, fugue, putsch /-utf/, hooch, doge /daud3/, Jan, bairn, coif, gaff, rev, 


hath, doth, vermouth /-39/, bourse /-uas/, fez, tarboosh, douche, gauche. 


Table Y Word Final -VCC 
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(8) Final - VCC (Table V) 


(a) Final CC clusters pattern as follows: — 


p+ tt, ES 

t + G25 
k+ t, S 
b + d, Z 
d + Z 
g + d, Z 
tf+ t 

d3+ d 

m+ p, d, E Goer 7 
n+ t, d, tf, d3, 0, Ss, Z 
j + E e A Z 
l + p,t, k, b,d, tf, d3, m, n, f, v, 8, s, z, f 
f + tt, 0, S 
v + d, Z 
0 + t, S 
Oo + d, Z 
s +p,t,k 

Z + d 

f+ t 

S d 


(b) in the present analysis of RP /r, h, j, w/ do not combine with 
other consonants in final positions. /g, 9, 0/ do not occupy final 
position in a final CC cluster. /p8/ occurs only after /e/, /m8/ only 
after /o:/, /In/ only after /1/, /1f/ only after /e/, /f8/ only after /1/. 
Only /dz/ occurs after all vowels; /nz, lz/ occur after all but 2 vowels. 
All other CC clusters show considerable restrictions in their ability to 
combine with preceding vowels. 

Vowels also show deficiencies in their ability to combine with the 59 
possible final clusters; none combines with all clusters. The following 
combine with less than half of the final CC clusters: /13/ (with only 2), 
mee (2), /03/ (5) 707 (8), 731/ (10), Zau/ (17), 79/7 (23), 73:7 (25), 7a1/ 
25): /3U/ (26), 71:7 (27), /u:/ (27), /e1/ (28), /a: 7 (28). The highest rate 
of combination with CC is shown by /1, €, A/. 
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(9) Final-VCCC (Table VI) 


(a) Final VCCC clusters pattern as follows:— 


E p +s 
[E y t +s 
k + tt k +s 
mop, f d +s 
n+ t, 0 >+s m+ p +t 
yn + k n +S, tf 
| +p,t,k,f,98 n +s, k 
f + ftt, 8 l + 8, p, k, tf 
s +p,t,k Soe E 
n+ d n + d3, Z 
l ore | + d3, m, v Yta 
k + S 
= iE 
ok +0 
|l + f 


(b) Only /nts/ (12), /ndz/ (11), /sts/ (15) cluster with more than half 
the possible preceding vowel phonemes. CCC clusters follow predomi- 
nantly short vowels. 11 of the 49 CCC final clusters occur after only one 
vowel (5 after /1/, 4 after /e/, 1 after /aA/, 1 after /3/). 


(10) Final - VCCCC 


Final CCCC clusters occur only rarely as a result of the suffixation to 
CCC of a /t/ or /s/ morpheme, e.g. /-mpts/ prompts, exempts, 
/-mpst/ glimpsed, /-Ikts/ mulcts, /-lpts/ sculpts, /-lf8s/ twelfths, 
/-ksts/ texts, /-KsBs/ sixths, /-nt8s/ thousandths. 


(11) With a vowel inventory of 20 items and the possible initial and 
final consonant clusters given inthe Tables above, it is clear that a large 
number of potential combinations are not utilized. Thus, such unused 
monosyllabic words as the following conform to an already existing 
pattern: /faud, said3, momp, bru:tf, plizk, splak, stred3/. If, in 
addition, deficiencies in the Tables are made good on the grounds of 
general patterning, it would be possible to construct words of an 
English phonological character with, for instance, initial /tfu-, r3:-, 
gloi-, skia-, sprau-/ or final /-9:g, -aitf, -u: nt, -a:nd3, -akst/, etc. 


9.10 Suffix Formation 


A limited amount of information has already been given (§9.3) on the 
effect on a word’s stress-pattern of the addition of certain suffixes. 
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These examples mostly concern derivatives and frequently involve 
changes in word-class. Inflectional endings (which do not normally 
affect stress) follow certain rules which affect segmental aspects of pro- 
nunciation. The following regularities may usefully be listed here. 


(1) Past tense formation 


For regular verbs in which the past tense is signalled by the addition of 
an -ed ending, the following rules of pronunciation apply: 
(a) If the stem ends in /t/ or /d/, add /1d/ 
e.g. exclude /ik'sklu:d, ik sklu:did/; guard /ga:d, 'ga:did/; rot 
/rot, rotid/; target /'ta: git, 'ta: gitid/ 
Otherwise 
(b) If the stem ends in any lenis sound (apart from /d/), add /d/ 
e.g. buzz /baz, bazd/; hammer /'heema, 'heemad/; kill /kil, kild/; 
listen /Usn, lisnd/ 
(c) If the stem ends in any fortis consonant (apart from /t/) add /t/ 
e.g. arch /a:tf, a:tft/; immerse /1m3:s, Ima: st/; kick /kik, kikt/; 
sniff /snif, snift/ 


(2) Plural formation 


(and 3rd person singular of present tense of verbs) 
(a) If the stem ends in a sibilant (a sound in which there is some 
element of fricative grooving), add /1z/ 
e.g. address /adres, adresiz/; arch /a:tf, ‘a:tfiz/; graze /greiz, 
'greiziz/; judge /d3ad3, 'd3ad31z/; rush /'raf, rafiz/ 
Exceptionally, the voicing of the fricative in house changes: /haus, 
‘hauziz/.! 
Otherwise 
(b) If the stem ends in any non-sibilant lenis sound, add /z/ 
e.g. blow /blau, blauz/; pattern /'peetn, 'peetnz/; regard /riga:d, 
riga:dz/; thrill /Oril, 6rilz/ 
(c) If the stem ends in any non-sibilant fortis consonant, add /s/ 
e.g. laugh /la:f, la: fs/; pick /pik, piks/; resort /ri'zo:t, rizo: ts/ 


(3) Present participle formation 


In all cases, add /1n/ 
e.g. kill /‘kil, ‘kilin/; laugh /la:f, la:fin/; sing /sin, sinim/; trim 
/trim, ‘trimin/ 
For cases where the stem ends with orthographic /r/, see (6) below. 
For stems ending in syllabic /n/ or M7, the syllabic nature of the 
nasal or lateral is frequently retained, e.g. handle / heendl, heendlin/ i 


l! In all comparable cases, such alteration is reflected in the spelling. 
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widen / waidn, waidnin/. However, some speakers may insert a/a, 
retaining the same number of syllables, thus /‘heendalin, waidanin/; 
whilst for others the nasal or lateral may lose its syllabic function, thus 
/‘heendhn, ‘waidnin/. It should be noted that in such cases, the quality 
of the /1/ is altered, the dark [t] of [hændt] being replaced by a clear [1]. 
(See also $9.6.) 


(4) Comparison of adjectives 


For those adjectives whose comparative and superlative degrees are 
formed by the suffixing of -er and -est respectively, the pronunciation 
of the stem remains unchanged except in the case of stems ending in /n/ 
or /r/ (see (5) and (6) below). Thus /a/ and /1st/ are regularly added, as 
in easy /i:zi, 'i:z19, ‘i:zust/; great /greit, greita, greitist/; big /big, 
'biga, bigist/. 


(5) Stems ending in /y/ 


When the comparative and superlative suffixes are added to stems 
ending in /n/, a /g/ is inserted, e.g. long /lon, ‘longa, longist/. 

In all other cases, the /9/ is followed immediately by the suffix, e.g. 
participle -ing as longing /'lonw/, adjectival modifier -ish as longish 
/'lon1f/, or agentive -er as hanger /'heeno/, singer /'sina/. 

It should be noted that monomorphemic words (not formed of a stem 
and affix) exhibit the sequence /-ng-/ intervocalically, e.g. anger 
/'engo/, finger /'fingo?. - 


(6) /r/- links in suffix formation ' 


In the case of words which end with orthographic r, an /r/-link is 
regularly inserted between the final vowel of the stem /9, 3:, a:,9:/ and 
any initial vowel of the suffix. 

e.g. present participles blur /bl3:, bl3:rıņ/; secure /s1kjva, 
si kjuarin/; stare /stea, 'stearin/; store /sto:, sto:rij/; comparative and 
superlative adjectives (stem + /a,1st/) clear /klia, ‘kliara, 'kliarist/. 

This process applies to derivational as well as to inflectional suffixes, 
e.g. adjectival -y, e.g. star /sta:, sta:ri/; agentive noun -er, e.g. 
murder /'m3:da, 'm3:dora/; verb-forming -ize, e.g. familiar /fa'milia, 
fa millaraiz/. 


9.11 Variability in the Phonemic Structure of Words? 


In connected speech, English words exhibit variations of accentual 
pattern and changes of a phonemic or phonetic kind, involving assimi- 


'This does not apply to past tense -ed (see l(b) above.) 
-See my ‘A note on the variability of the phonemic components of English words”, 
Brno Studies in English, vol. 8, 1969. 
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lation and elision, especially at word boundaries (see Chapter 11). 
There is also often a remarkable latitude in the choice of phonemes used 
in words when said in isolation by RP speakers. Even with the exclusion 
of cases of differing phonemic inventories—e.g. the choice between 
using /m/ or /w/ for wh words or /2:/ or /23/ in words of the bore 
type—there remains a high degree of variability within the same variety 
of pronunciation. The permissible variations concern mainly vowels, 
but cases of a choice of consonant also occur. The following are 
examples: — 


(1) Vowels 


/i:/ N /1/ ‘acetylene, economy’; ~ /e/ “economics, premature’; 
~ /ei/ ‘deify, detour’; ~ /al/ ‘Argentine, iodine’ 

/1/ y /e/ ‘alphabet, orchestra’; y /ai/ ‘privacy, dynasty’; ~ /el/ 
‘magistrate, holiday’; ~ /a/ ‘believe, system, adequate’ 

/e/ N /e1/ ‘again, maintain’; y /a/ ‘accent, emblem’ 

/æ/ N /a:/ “graph, translate’; y /e1/ ‘patriot, amoral’; 
~ /ea/ ‘larynx, pharynx’; y /a/ ‘agnostic, vapidity’ 

/A/ N /d/ ‘constable, combat’; y /a/ ‘bankrupt, dandruff’ 

/0/ N /d:/ ‘salt, wrath, Australia’; ~ /a/ ‘obscure, obligatory’ 

/a:/ ~ /Ua/ “sure, poor’ 

/U/ N /u:/ ‘Toom, groom’ 

lu /n /ju: / ‘suit, supreme, allusion’ 

/ei/ y /a:/ ‘data, esplanade’ 

/30/ y /3/ ‘allocate, phonetics’ 


(2) Consonants 


/t/ N /tj/ N /tf/ ‘amateur’; /tj/ y /tf/ “actual, Christian’; /dj/ ~ 
/d3/ ‘educate, grandeur’; /d3/ N /3/ ‘garage’; /g/ N /d3/ ‘gibberish, 
pedagogic’; /ntf/ y /nf/ ‘French, branch’; /nd3/ y /n3/ ‘revenge, 
strange’; /k/ y /kw/ ‘quoit’; /nk/ y /n/ ‘anxious’; /ng/ N /n/ 
‘English’; /sj/ N /si/ y /fi/ ‘associate’; /sj/ y /f/ ‘issue, sexual’; /zj/ 
~ /3/ “usual, azure’; /f/ y /3/ ‘magnesia, Asia’; /s/ N /z/ ‘usage, 
unison’; /f/ y /p/ ‘diphthong, naphtha’. 


10 


Connected Speech 


10.1 Accent 


Connected speech, ¡.e. an utterance consisting of more than one word, 
exhibits features of accentuation that are in many ways comparable 
with those found in the polysyllabic word. Thus the character of a 
connected utterance may be said to be determined both by a changing 
pattern of successive qualities and quantities and also by the relation- 
ship of its parts, 1.e. of the words composing the continuum. Some 
parts of the connected utterance will be made to stand out from their 
environment, in the same way that certain syllables of a polysyllabic 
word are more prominent than their neighbours; in both cases, accen- 
tuation has a contrastive function. Accentuation in connected speech 
differs, however, from the usual case of a polysyllable in that the 
situation of the accent in connected speech is determined largely by the 
meaning which the utterance is intended to convey. The meaningful, 
distinctive, function of accent in words is, on the other hand, of 
restricted application (89.5). The following selected examples illustrate 
some of the similarities and differences of accentual patterns in the 
word and in connected speech: — 


She can may be Y o /fi: ken/or. & /fikzen/ (cf. the varying 
patterns of insu/t as verb or noun), 
but not Y . / fi: kan/, as in beacon. 
She can go may be Y . e (cf. telephone) 
> o (cf. tobacco) 
e . y Or. . (cf. cigarette) 
butnot Y . . (cf. bachelor). 
How can she? may be y . . (cf. quantity) 
e y . (cf. uncertain) 
butnot , y . (cf. encounter). 
How can she do it? may be e . . Y . (cf. interdependence) 
é > a Cl aistocacy) 
but not , y... . (cf. administrative). 
We put the case in the hall 
maybe. e . e. . . >) 
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o + nd * 


¡A , etc., 
but not. e . . . ẹ . (cf. industrialization) 
with no accent on case and hall. 


o. "o. . e 


oe y. 
èe y C o o 


He understands the importance of practising—where understands, 
importance, practising, must have an accent of some kind on their 
appropriate syllables, whether or not he, for instance, is accented. 

Such examples suggest that: — 

(1) Although accentual patterns of connected speech are freer than 
those of the word and are largely determined by the meaning to be 
conveyed, some words are predisposed by their function in the language 
to receive accent. These content or lexical words are typically main 
verbs, adverbs, nouns, adjectives, demonstrative pronouns, etc. Other 
categories of words, such as auxiliary verbs, conjunctions, preposi- 
tions, pronouns, relative pronouns and articles (form or grammatical 
words), are more likely to be unaccented, although they, too, may be 
exceptionally accented if the meaning requires it. 

The examples given above illustrate the freedom of accentual 
patterning in utterances taken without a context. But the meaning of 
any utterance is largely conditioned by the situation and context in 
which it occurs. Thus, it must be expected that the freedom of accentual 
patterning of the utterance and, in particular, of the situation of the 
primary (tonic) accent will be considerably curtailed by the constraints 
imposed by the contextual environment. In the case of an opening 
remark, or when a new topic is introduced into a conversation, there is 
greater scope for variations of meaning pointed by accentuation. Or 
again, it happens that some accentual freedom Is possible in responses; 
thus, in response to the statement She came last week( . e e ò ),an 
incredulous reaction Last week! might have the pattern Y o ! (i.e. 
“Wasn't it the week before?”) or e Y (i.e. ‘Don’t you mean last 
month?’); or, in response to What was the weather like?, the reply /t 
rained every day might have the pattern . e e . 3 (emphasizing 
the continuous nature of the rain)or . Y o . e(wherethe fact of 
raining is emphasized). But, in the following dialogue, such constraints 
are placed upon the accentuation, both by the context and by the nature 
of the content words, that little variation is possible (those words likely 
to be most strongly accented being printed in italic):— 


l Note that the sign Y here shows the place of the primary, tonic accent. lt may be 
marked by the onset of a falling or, as here, a rising change of pitch direction. 
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‘Did you have a good holiday?’ 

‘Yes, very good.’ 

Was the weather all right?’ 

‘It was fine for the first part, but for the rest of the time it was pretty 
mixed. We enjoyed ourselves though. We had the car, so we were able 
to do some sight-seeing, when it was too wet to go on the beach.’ 


(2) Many monosyllabic form words are subject to qualitative varia- 
tion according to whether they receive the accent or.not. Nevertheless, 
weakening of an unaccented form word may be inhibited by its 
position, e.g. can /kan/ or /kn/ in unaccented initial and medial 
positions in the utterance, but /ken/ in an unaccented final position. 
On the other hand, monosyllabic content words in relatively weakly 
accented positions (adjacent to a strong accent) retain their full vowel 
value and the prominence associated with it, e.g. how in How can she? 
(e y .); or again, in We put the case in the hall, case and hall will 
always receive a secondary accent, wherever the primary accent Is con- 
centrated, because of the qualitative prominence of their full, strong 
vowels (and, frequently for the rhythmic beat which they carry), 
whereas put with its relatively weak vowel /U/ may lose all accentual 
prominence, e.g. in the second pattern given for this sentence on p. 261. 

(3) Morethan one word in an utterance may receive a primary (tonic, 
nuclear) accent. A deliberate, emphatic, or excited style of speaking 
often exhibits a proliferation of primary, tonic accents; a more rapid 
and matter-of-fact delivery is likely to show fewer primary and more 
secondary accents. 

(4) Polysyllabic words, containing one accented syllable when said 
in isolation, carry an accent (primary or secondary) on that syllable in 
connected speech. 

(5) In an extended dialogue in normal conversational style, the 
number of weak syllables (unaccented) tends to exceed that of those 
carrying an accent (primary or secondary). 


10.2 Manifestation of Accent in Connected Speech 


The same factors which we saw to be responsible for achieving a pattern 
of prominences in the word, 1.e. variations of pitch, length, stress, and 
quality, contribute to the manifestation of the accented parts of 
connected speech. Just as in the case of the word, it is necessary to 
distinguish those factors which are significant realities only for the 
speaker from those which, being perceived and interpreted by the 
listener, are significant in the function of speech as an act of 
communication. 
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10.3 Stress and Rhythm 


The speaker is aware of the occurrence in the utterance of a number of 
strong stresses or beats corresponding to those parts of the utterance to 
which he wishes to attach particular accentual meaning and on whichhe 
expends relatively great articulatory energy; the remaining words or 
syllables are weakly and rapidly articulated. The number of syllables 
stressed by the speaker depends largely upon the nature of the words 
composing the utterance. Thus, an utterance containing a high per- 
centage of content words is likely to receive more stresses than one with 
the same number of syllables but a higher proportion of form words: 
compare, for example, two sentences of eight syllables: — 


The first six have all won a prile—. e e . e. o. . Y 
There were prizes for six of them—. . e. . LVL. . 


Stress of this kind constitutes a reality for the speaker, but is not, by 
itself, an efficient means of communicating accent in connected speech. 

Nevertheless, the syllables uttered with the greatest stress constitute, 
for the speaker, hubs with which unstressed syllables will be associated 
to form rhythmic groups.! It is a feature of English that the utterance is 
delivered as a series of close-knit rhythmic groups, which override in 
importance on the phonetic level the significance of the word on the 
linguistic level. Compare the rhythmic groups, based on a strongly 
accented syllable, which are likely to occur in the following sentences: 


















































They couldn't have | chosen a | better | time for their | holiday 

. 3 o . @ . o O o G . . . . 

I want to | I want | to go Iwant | to go | now 

. Y o o © e y e ® € 

I want | togo | to-morrow | 

e O e O 2 ` o 

Mary and | George | specially | want | to go | to-morrow 
© o © Cd] 0 9 ® o O e X e 














It is noticeable that the rhythmic beats of an utterance occur at fairly 
equal intervals of time.? As a result of this, the speed at which the 


lSee W. Jassem, ‘Stress in Modern English,’ Bulletin de la Société Polonaise de 
Linguistique, Cracow, 1952; Intonation of Conversational English, Wroclaw, 1952; 
‘Indication of speech rhythm of educated southern English,’ Le Maitre Phonétique, 
no. 92, 1949. 

21t should be emphasized that such isochrony is of a very approximate kind, J. D. 
O’Connor has shown (‘The perception of time intervals,’ Progress Report, 1965, 
Phonetics Laboratory, University College, London) that even when a speaker is 
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unstressed syllables are uttered—and the length of each—will depend 
upon the number occurring between the strong beats. All the unac- 
cented syllables occurring between two strong beats may not, however, 
be uttered with equal rapidity: those following the strong beat of a 
rhythmic group tend to occupy slightly more time than those which 
precede the strong beat. Thus, in the phrase The authority | of the 
government | is in danger, particular speed of utterance is associated 
with the unaccented of the, is in, compared with -ity, -ernment. The 
rhythmic grouping of unaccented syllables generally correlates with 
grammatical word-clusters; a rhythmic division will not normally fall 
within a word pattern, but several words may combine to form one 
rhythmic group. It often happens, however, that an unaccented word 
may equally well be assigned to either of two rhythmic groups, e.g. in 
They couldn’t have chosen, the weak have /hav/ may be the last syllable 
of the first group or the first syllable of the second group. 

Syllables carrying a strong stress are usually signalled to a listener by 
means of pitch prominence (see §§10.5, 10.5.1), but it frequently 
happens that asyllable stressed by the speaker has no pitch prominence. 
In this case, the syllable (or word) is given some accentual prominence 
for the listener by a combination of any of the factors of loudness 
(relating to stress), sonority (relating to the quality and length of the 
syllabic sound), and the nature of the word (‘content’ as opposed to 
‘form’). The following sentences illustrate the occurrence of stressed 
syllables said on a monotone within an utterance or, following a pri- 
mary accent initiating a rise, within a rising post-tonic sequence of 
syllables; such stressed syllables, without pitch prominence, may be 
said to carry for a listener secondary accent of a rhythmic, qualitative/ 
quantitative, or semantic kind: — 


seems 1n : o : e ae 
it seems impossible 


wants, have in ; ə : ocean 
he wants to have it now 





deliberately attempting to produce isochrony in doggerel verse the real durations of 
groups vary between 488 and 566 msecs and that a listener’s perception of equality of 
intervals between stimuli allows for considerable latitude. O’Connor also demonstrates 
(‘The duration of the foot in relation to the number of component sound segments,’ 
Progress Report, 1968) that, in a constant frame with a variable item having different 
numbers of segments, the frame items do not accommodate appreciably to the length of 
the variable. The variable shows greater length according to the number of its segments, 
though the duration is not evenly proportional to the number of segments. All that can be 
said is that there is a tendeney towards compression and isochrony as the number of 
segments increases. 
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forty in a a E 


it was in nineteen forty-nine 


(cf. fifty in re a A 


1t was in nineteen fifty-nine 


where the stressed /1/ of fifty may not have the same qualitative 
prominence as the stressed /: / of forty) 


C a ŮĖŮĖŮĖĖ—— 
ere 


said, go in Wale N aa : e 
they all said they would go 


(where the vowels of both cases of they are likely to be weakened) 


o ° re e o 
understand, say in E 


does he always understand what you say? 


(For variations of word accentual patterns in connected speech, see 
Zo) 


10.4 Quantity and Quality 


(1) Accented words.—Vowels and continuant consonants in accented 
syllables which form the hub of a rhythmic group are shortened accord- 
ing to the number of unaccented syllables (especially following) in the 
group. Thus, the /a1/ of /taid/ (tide) shows progressive shortening in 
such rhythmic groups as tidy, tidily, she tidied it, etc.; the /A/ and /m/ 
of /kam/ (come) are similarly shortened in comfort, comfortable, 
come for me, circumference. Or again, a comparable phonemic 
Sequence will have slight variations of sound length according to the 
division into rhythmic groups: cf. aboard a liner /ab3:d a laina/ and a 
border-liner /a bo:da ,laina/—/d:/ being slightly shorter in the second 
case; minor official /'mamar afifl/ and mine are official /‘main 
ara fifl/—/ai/ being longer in the second case. Such variation of 
rhythmic grouping, involving changes of quantity, constitutes a reality 
for the speaker, but it is doubtful whether slight modifications of this 
kind are markedly significant to a listener, since the choice of meaning 
for such similar phoneme sequences is normally determined by the 
context, such cues as are provided by rhythmic variation of quantity 
being redundant. 

(2) Unaccented words.—A more marked effect is that which charac- 
terizes the quantity and quality of unaccented words. Content words 
(monosyllables and polysyllables) generally have in connected speech 
the qualitative pattern of their isolate form and therefore retain some 
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measure of qualitative prominence even when no pitch prominence is 
associated with them and when they are relatively unstressed, e.g.:— 


why, try, jump in O eee 
why don’t you try and jump? 





nuisance in A ic O 
what a nuisance 


o ° e ° o 
nearly, quarter in de lA 


nearly quarter to seven 


But many form words have two or more qualitative and quantitative 
patterns according to whether they are unaccented (as is usual) or 
accented (in special situations or when said in isolation). As compared 
with the accented realizations of these words (the ‘strong’ forms), the 
unaccented (‘weak’ form) varieties of these words show reductions of 
the length of sounds, obscuration of vowels towards /3, 1, U/, and the 
elision of vowels and consonants. The following list of examples pre- 
sents the most common of these words, first in their unaccented 
(normal) weak forms and secondly in their less usual! accented strong 
form:— 


Unaccented Accented 
a Jor /ei/ 
am /m, om/ /eem/ 
an /n, on/ /æn/ 
and /ənd, nd, ən, n/ /eend/ 
are /3/ + consonant A 

/or, r/ + vowel Jar 
as OZ /2ez/ 
at /at/ /et/ 
be /b1/([b1]) /b1:/ 
been /bin/ /b1:n/ (also 


/bin/ for some 
speakers) 


l The following 42 items occur in the first 200 most common words in connected 


speech: the, you, 1, to, and, a, that, we, of, have, is, are, for, at, he, but, there, do, as, be, 
them, will, me, was, can, him, had, your, been, from, my, or, she, by, some, her, his, us, 
an, am, has, shall. lt is significant that of these the following 19 have over 90% 
unaccented occurrences with a weak form: at, of, the, to, as, and, or, a, his, an, but, been, 
for, her, we, be, shall, was, them. Other items of rarer occurrence, such as than, only 
exceptionally occur with the strong form. 
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Unaccented Accented 
but LBS /bat/ 
can (aux.) /kon, kn/ /keen/ 
could /kad, kd/ /kud/ 
do (aux.) do daa /du:/ 
does (aux.) daz. 25 /daz/ 

(e.g. What's (= does) he like? /'wotsi: 'laik/, 
When’s (= does) he arrive? /'wenz i: ə'raiv/) 
for /fa/ + consonant ioe 

/far, fr/ + vowel PODES 
from /from/ /from/ 
had (aux.)' "had. sata. /heed/ 
has (aux.)! Anoz, 3z; Z, S /hæz/ 
have (aux. )! hov, 3v, V~ /hæv/ 
he ¡Miss (bi Di 
her ¿LISO = /h3: / 
him /im/ /him/ 
his GS /hiz/ 
iS AS Zi /1Z/ 
me /mi/ ((mi)) /mi: / 
must /maost/?* /mast/ 
not /nt, n/ /not/ 
of LOVIN S /DV/ 
Saint /sant, snt, san, sn/ /semt/ 
shall E /feel/ 
she /fi/ (ffi) A 
should /fad, fd/ /fod/ 
Sir /sa/ + consonant /83: / 

/sor/ + vowel /s3:1/ 
some 
(unspecified quantity) /sam, sm/ /sam/ 
than /ðən, On/ /ðæn/ (rare) 
¿hat (conj. and rel. 

pron.)* 7 Oo; /Oet/ (rare) 


| The distinction between auxiliary have and the main verb may be illustrated by exam- 


ples such as He considered what he /had/ left (weak form of auxiliary verb = what had 
been left by him) and He considered what he /hed/ left (main verb, non-weakenable = 
what he still possessed). 

2A weak form with /h/ would normally be used when unaccented but following a 
pause. 

3 Often /mas/ with elision of /t/. See §11.3.6. 

4 That as a demonstrative adjective or pronoun is always accented, e.g. that man / det 
meen/, that’s the one / daets da wan?/. 
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Unaccented Accented 
the /91/ ([91]) + vowel /ði:/ 
d3/ + consonant A 
them /ðəm/ also /am, m/ /ðəm/ 
there (indef. adv.) /0a/ + consonant /ð£ə/ (rare) 
/ðər/ + vowel /Oear/ (rare) 
to /ta/ + consonant /tu:/ 
/tu/ + vowel K 
us /as, s/ /AS/ 
was /waz/ /woz/ 
we /wi/ ({wi]) /wi:/ 
were /wa/ + consonant /w3:/ 
/woar/ + vowel /w3ir/ 
who /hu, uz, U/ ([hu))? /hu:/ 
will ae /wil/ 
would [Wad dia /wud/ 
you /\0/ ({jul) /ju:/ 


It should be noted that verb forms such as am, are, be, can, could, 
do, does, had, has, have, is, must, shall, was, were, will, would retain a 
strong form when they occur finally even though they are unaccented, 
e.g. Who's coming? 'Iam /em/; Who's got it? 'I have /hev/ . 

Similarly, prepositions, e.g. to, from, at, for, apart from having a 
strong form when receiving a primary (nuclear, tonic) accent, also have 
a qualitative prominence when final and unaccented, e.g. Where have 
they gone to? (/tu:/, also /tu/, but not /ta/); Where’s he come from? 
(/from/ rather than /fram/); What are you laughing for, at? (/fo:z, 
zt/). This applies, too, when prepositions and auxiliary verbs occur 
finally in a rhythmic group including at a ‘deletion site’ where the 
following item is understood, e.g. He ‘looked at /zt/ and ‘solved the 
'problem; or ‘people who can af ford /tu:/ (sc. do so) ‘buy ‘luxuries cf. 
‘People who can af ford /ta/ ‘buy ‘luxuries, do so. When a preposition 
occurs before an unaccented pronoun, either the strong or the weak 
form may be used for the preposition, e.g. I gave it to you (/ta/ or 
/tu:/); I’ve heard from him (/fram/ or /from/); / waited for you (/fa/ 
or /f9:/); I looked at her (/at/ or /xt/). 

Note, too, that certain form words, not normally possessing an 
alternative weak form for unaccented occurrences, may show such 
reductions in very rapid speech, e.g. J (/a/) don’t know; What's your 
(/39/) name?; I go by (/b3/) bus; Do you know my (/ma/) brother?; for 


! As a demonstrative adverb, there will be accented, e.g. there’s the book /eaz da 
buk/. 

2A weak form with /h/ would normally be used when unaccented but following a 
pause. 
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love nor (/n9/) money;' two or (/3/) three;! ever so (/s3/) many.' In the 
case of the disyllables any, many, a qualitative prominence may be 
retained on the first syllable under weak accent—/en1, ment/, but fully 
reduced, unaccented forms may be heard in rapid speech, e.g. Have any 
more come? /'heevni mo: 'kam/; How many do you want? /'hau mni 
dju: ‘wont/. Other monosyllabic form words normally retain their 
strong vowels in relatively unaccented positions, e.g. on, when, then, 
one, but again, in very rapid speech, reduced vowel forms may be 
heard, especially when the word is adjacent to a strongly accented 
syllable, e.g. What on (/an/ or /n/) earth!; When (/wan/) all’s said and 
done; Then (/ðən/) after a time; one (/wan/) always hopes. 

It may be said that the more rapid the delivery the greater the 
tendency to reduction and obscuration of unaccented words. Even 
monosyllabic content words may be reduced in rapid casual speech, if 
they occur in a relatively unaccented situation adjacent to a primary 
accent, and especially if they contain a short vowel, e.g. /1/, You sit 
over here /'ju: s(2)t Uva 'h19/; /U/, He put it there /‘hi: p(o)t it ,dea/; 
/a/, He’ll come back /'hi:l k(2)m beek/; /e/, Don’t get lost /'dount 
g(ə)t 'Iost/; less frequently with the more prominent short vowels /z, 
D/, e.g. /z/, They all sat down on the floor /de1 ‘9:1 sat 'daun an da 
flo:/; /v/, We want to go /'wi: want ta ,gau/; and, finally, the 
diphthong /au/, with its dominant central [a] element, is readily 
reducible to /a/ under weak accent, e.g. You can’t go with him /ju 
ka: nt ga wid 1m/; He's going to do it /‘hi:z gana ,du: 1t/. 


10.5 Intonation? 


The meaning of an English utterance, i.e. the information it conveys to 
a listener, derives not only from its changing sound pattern and the 
contrastive, accentual prominences already referred to, but also from 
associated variations of pitch. Such rises and falls in pitch level, or 
patterns of intonation, have two main functions:— 

(1) Accentual.—Intonation changes are the most efficient means of 
rendering prominent for a listener those parts of an utterance on which 
the speaker wishes to concentrate attention; pitch change is especially 
significant as a cue for signalling the word or words carrying primary 
(nuclear) accent. It should be remembered, however, that the accentual 
pattern of a response is often largely conditioned by constraints 
imposed by the context (see §10.2). 


| These weak forms seem to be restricted to a limited number of phrases and, in the case 
of or, particularly occur in linking two forms as in the example given here. 

2 For a more complete treatment of intonation in these terms, see J. D. O’Connor and 
G. F. Arnold, The Intonation of Colloquial English, 1973, Longman, and works referred 
to in the bibliography under Palmer, Kingdon, Schubiger, Jassem, Lee, Crystal, 
Cruttenden. 
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(2) Non-accentual,—In addition, intonation is used as a means for 
distinguishing different types of sentence, e.g. the same sequence of 
words may, with a falling intonation, be interpreted as a statement or, 
with a rising intonation, as a question.’ Moreover, a listener derives 
from a speaker’s intonation information as to the latter’s emotional 
attitude (to the listener or to the topic of a conversation) or personality, 
e.g. his intonation might reveal a patronizing attitude to the listener, an 
incredulous attitude to the topic or a querulous disposition. Further- 
more, intonation has a complex linking function in dialogue and within 
the speech of a single speaker. 

In so far as a listener interprets correctly those parts of an utterance 
upon which the speaker wishes to concentrate attention, or is aware of 
the speaker’s attitude to him, or makes judgments upon the personality 
of the speaker, the pattern of intonation used may be said to constitute 
a linguistic system which has a communicative function within a partic- 
ular community. There seems no doubt that intonation in its accentual 
function and in that non-accentual function concerned with the dis- 
tinction of sentence types, does constitute a linguistic reality of this 
kind. Since, however, we sometimes misinterpret the emotional atti- 
tude as conveyed by intonation, it may be said that non-accentual 
intonation patterns of this kind are less perfectly systematized, or that 
such linguistic systems are more numerous and applicable to smaller 
communities (regional or social) than phonological systems, so that a 
faulty judgment of emotional attitudes conveyed by intonation cues 
may derive from an interpretation of these cues in terms of our own, 
different, intonation usage in showing such attitudes (cf. the interpreta- 
tive adjustments needed on the phonological level between speakers of 
two different types of English). 


10.5.1 The Accentual or Emphatic Function of Intonation 


The various degrees of accentuation in an utterance may be signalled by 
means of intonation in the following way:— 


(a) Primary (nuclear) accent—by means of a change of pitch direc- 
tion initiated by the syllable receiving the accent (marked , , `, ,, 
E 

(b) Secondary (pitch prominent) accent—by means of a change of 
pitch level (higher or lower) on the accented syllable (marked '). 

(c) Secondary accent without pitch prominence*—secondary accent 
on some words may be manifested by qualitative, quantitative, 
or rhythmic prominence, without pitch prominence (marked ,). 


! It should be noted that this is not meant as a generalization about the intonation of 
questions: as such it would be an over-simplification. See $10.5.3. 
¿Such accentuation is regarded by some as stress rather than accent. 
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(d) Unaccented syllables—do not normally have pitch or other pro- 
minence and are unmarked (see, however, §10.5.4). 


10.5.1.1 Realization of Primary Accent 


The primary accent (or accents) in a sentence is shown by initiating a 
change of pitch direction, with the nucleus (carrying a falling or rising 
tone or a combination of the two) on the appropriate syllable of the 
word (or words) on which attention is particularly to be concentrated. 
The situation of the nucleus or nuclei is, therefore, of prime importance 
in conveying meaning, e.g. compare the meanings of the following 
sentence according to the shift of nucleus: — 


(a) ‘Jack likes ¡fish (1.e. not George, but Jack) Ne 
(b) ¡Jack ‘likes ¡fish (i.e. there is no question of his hating fish) 


e\o 


(c) ¡Jack ¡likes ‘fish (i.e. not meat or poultry, etc.) è e = 


or again, the meanings of (a) and (c) may be combined in 
Diack likes “fish, NY 5. >) 


In the sense that the nuclear syllable stands out from amongst its 
neighbours (both accented and unaccented syllables), the nucleus and 
its situation may be said to have a special contrastive function, a 
function of emphasizing a particular part of an utterance and thus 
focusing attention on it. But it may happen that special contrastive 
prominence is not attached to any of «the accented syllables of a 
sentence—this is more common in reading than in normal speech, in 
which it is usual for one or more words in a sentence to be ‘pointed’. 
Where no word is specially contrasted, the nuclear change of pitch 
direction will be initiated by the last accented syllable, e.g.:— 


By the 'time we ‘got to the ,house, we were all 'wet \through. 


In a more animated style, more typical of conversation, the same 
sentence might have nuclear pitch changes on several accented 
syllables, e.g.:— 


By the ‘time we ‘got to the “house, we were “all ¡wet ‘through. 


The way in which the change of pitch direction is realized depends on 
the type of change involved and on the nature and extent of the phono- 
logical sequence covered. 
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10.5.1.2 Types of Nuclear Tone 


(1) The falling nuclear tone (,, *).—The falling glide may start from 
the highest pitch of the speaking voice and fall to the lowest pitch (in the 
case of the high-fall), or from a mid pitch to the lowest pitch (in the case 
of the /ow-fall), or with variation of starting point according to the 
intonation context. The falling glide is most perceptible when it takes 
place on a Syllable containing a long vowel or diphthong or a voiced 
continuant (e.g. /m, n, D, l, z/, etc.), e.g.:— 


———— XA 


N A AA 


Wo. He 'couldn't be „seen. It was ‘raining. 





When a fall occurs on a syllable containing a short vowel with its limits 
formed by fortis, voiceless consonants (especially the stops /p, t, k/), 
the glide, particularly of a low-fall, is so rapid that it is not easily 
perceptible, or may be realized merely as a low level pitch in relationtoa 
preceding higher pitch, e.g.:— 


@ e e N 
‘What have you ‘got? 


oe ict << ‘es 
but ‘What have you got? or 


Again, when syllables follow the nucleus—the tai/—the fall may be 
realized as the juxtaposition of relatively high pitch on the nuclear 
syllable and low pitches on the syllables of the tail, e.g.:— 


Bio = 


It’ll ‘rain in a ¡minute 


e, 

0 
o + o. e 
ita are ee o 


but Pd for‘gotten or 
A e + . . o 
It was ‚yesterday or 


(2) The rising nuclear tone (,, ’).—In the same way, a rising glide, 
which may extend from low to mid, or from mid to high, or with other 
variations of starting and end points between low and high, is more 
easily perceptible when it occurs on a syllable containing a long vowel or 
diphthong or a voiced continuant consonant, e.g.:— 
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e) AS . e ae 
¿¡No. ‘Can you ,see? He's ¡not ill. 


When a low-rising glide occurs on a short syllable, it must necessarily be 
accomplished much more rapidly, or may merely consist of a relatively 
high level pitch in relation to a preceding low pitch, or even of a slightly 
lowered level pitch in relation to a preceding mid or high pitch, e.g.:— 


9 O a O o 
© a a 9 
v Al EEE IN 


| 
¡Can she ,cook? Can she ,cook? or 


With a tail, the rise is achieved by means of a relatively low pitch on the 
nuclear syllable with an ascending scale on the following syllables, 
e.g.:— 





o e 3 o . 
e e a 





Is it ,raining? ‘Are you ,comfortable? 


(3) The falling-rising nuclear tone ().—The fall and rise may be con- 
fined within one syllable, the glide beginning at about mid level and 
ending at the same level (or slightly above or below); in the case of a 
short syllable, the dip in pitch 1s made extremely rapidly and may be 
realized as an instant of ‘creaky’ voice or even of cessation of voice, 
e.g.:— 


2 | 


s Ay A y 
No. It’s “true. It’s shut. 


w 


When an unaccented tail follows the nuclear syllable, the fall occurs on 
the nuclear syllable and the rise is spread over the tail, e.g.:— 


a: e, 
It's ‘raining. It’s quite “comfortable. 
It’s ‘not a “caterpillar or 


When a secondary accent follows the primary (nuclear) accent, the fall 
takes place on the nuclear syllable and the rise 1s initiated on the syllable 
carrying the secondary accent, e.g.:— 


mor eee, ee ee , i ee 


He ‘didn’t “telephone. He's educated or 
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In a longer utterance, the rise begins on the last stressed syllable of the 
tail. What amounts to the same pattern may occur where, within the 
same group, a word containing a falling syllable is followed by a word 
containing a rising syllable, both syllables carrying a primary accent, 
e.g.:— 


AE A ARA 


YYou can ,see. “Come along. ¡Bring me a ,cup. 
A A... T 
e e © e e ə 
\Nothing’s im,possible or 


(4) Rising reinforcement of a fall ().—A fall may be reinforced by 
an introductory rise, especially on a long syllable containing voiced 
continuant consonants (which may be given extra length), e.g.:— 


. e 6N o o O . o 6N 
It was raining. He ,wasn’t a lone. 


A reinforced short syllable followed by a tail may be realized as a low 
accented nuclear syllable followed by a fall on the tail, e.g.:— 


= e VE ae 


You’d “better. It was “yesterday. 





When such reinforcement occurs on the falling part of a divided fall- 
rise (i.e. when the rise is situated on a later word), it has the effect of 
focusing attention on the falling nucleus, e.g.:— 


“You can ,see or 


o™ , e @ ” í $ ie o ° i 
“Nothing’s im,possible or 


(5) Ina running dialogue, more than half of the nuclear tones may be 
expected to be of the falling type, followed in frequency by the fall -rise 
or divided fall-rise, and finally the rises. The actual proportion of tone 
types will, of course, depend to a certain extent upon the style of the 
conversation and the emotional attitudes of the speakers. See $11.6. 
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10.5.1.3 Realization of Secondary Accent 


(1) Pre-nuclear.—Syllables preceding the nucleus may have pitch pro- 
minence (secondary accent, marked ') through being given a high level 
pitch when initial or a high level pitch in relation to preceding syllables, 
e.g.:— 





o ° 3 e. A a 


‘When do you ,want it? We waited ‘twenty \minutes. 





The first pre-nuclear accented syllable has been called the head (a ‘high 
head’ if given pitch prominence as above); any syllables occurring 
between the head and the nucleus are then said to constitute the body. 
Alternatively, many would define the head as the stretch of speech from 
the first accented syllable up to (but not including) the nucleus. The 
term body would then be discarded. Accented syllables within the body 
(or further syllables within the head) may be given pitch prominence by 
means of a step down in pitch, e.g.:— 


Te Se o... 
It ‘all 'went wrong. He didn’t understand a thing. 
o E o e a 


'Can you let me ‘have it to,morrow? 


Additional pitch prominence may also be given to an accented syllable 
in the body,! in a sequence of such accented syllables, by means of a step 
up (marked'), thus constituting in effect a supplementary head, e.g.:— 


: o n 
We 'came ‘home to ‘find "nobody «there 


3 e a 
'One of the 'two ‘lorries "suddenly be'gan to , skid. 


Such a pattern of pitch accentuation would have the effect of giving 
special prominence to nobody and suddenly. Alternative intonation 
patterns would divide these sentences into two groups:— 





o 2 e e 
Y) |... o 
We'came ,home to find ‘nobody ¡there 


lie. within the head in most terminology. 
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O e G A 
'One of the 'two ,lorries 'suddenly be'gan to ‚skid. 


Pre-nuclear syllables may also be accented without pitch pro- 
minence, i.e. they are accented only for reasons of rhythmical stress, 
quality, or quantity, or because of the content nature of the word 
(marked ,, whether high or low in pitch). They may be the only accented 
syllables before the nucleus and be said on a relatively low level pitch (a 
‘low head’), e.g.:— 


@® > AU) O s @ . Y 
It ¡can't be ,helped. Put it ¡over ‘there. 


9 ° 0 o o eS J 
It ¡wasn't ¡what I ex pected. 





If such syllables precede a high head, they are relatively low and level, 
e.g.: — 


a 
$ o . e 


o € a e 
. lo . 
He arrived just as we were ,leaving. 


en 242 SO o 
Can you be ‘quite ,sure? 


(the high head on just and quite giving these words special prominence). 
If such syllables occur within the body (or head), they are kept on 
about the same pitch as the preceding pitch-prominent syllable, e.g.:— 


o ® . @ . o A 
And ‘that’s ¡what we ,want to find ‘out. 


e) 


‘Will you be coming to ¡see us ,off? 





(2) Post-nuclear.— After a falling nucleus, a secondary accent is 
manifested by rhythmic, qualitative, or quantitative prominence, the 
pitch remaining low (marked ,), e.g.:— 


A A E 


It was a mew hat. It “can be done: 


After a rising nucleus, however, syllables carrying secondary accent 
(also marked ,) continue on a rising pitch, e.g.:— 
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'Did you ,see him yesterday?  ,I didn’t ¡think it was funny. 


When final, they may have a certain additional pitch prominence, 
e.g.:— 


o 
o o 0 f 0 al 


‘Does he „like ¡dogs? "Did you ‘see how ,pale he looked? 


cf. 'Have you 'put the ,kettle on? and'Have you ‘got the ,catalogue? 
In the case of a falling-rising nucleus (whether on a single word where 
a secondary accent falls on the final syllable or in cases where the rise 


takes place on a following word), the pitch prominence (consisting of a 
rise) is very considerable, e.g.:— 


a). a ars | ee ee 


I haven't ¡much “appetite. It ‘wasn’t ¡very appetizing. 
wa) eae ai ek oe Ey 
I’ve ‘got the catalogue cf. I’ve ‘put the ‘kettle ,on. 


(3) Secondary accent between fall and rise.—Syllables carrying 
secondary accent, without pitch prominence, may occur between a fall 
and a rise; in this case, the unaccented syllables and those carrying 
secondary accent have a relatively low level pitch, e.g.:— 


O ee, 


“That was quite ,good. 





NE E A 


‘All of us were sur prised to ¡hear that you'd ,gone. 


10.5.1.4 Realization of Unaccented Syllables 


Unaccented syllables, in addition to the fact that they are said very 
rapidly and usually undergo some obscuration of their quality, do not 
normally have any pitch prominence. They may occur before the head, 
before the nucleus, or after the nucleus. 


(1) Pre-nuclear. —Unaccented syllables occurring before a nucleus 
(without a head), like syllables carrying secondary accent without pitch 
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prominence, are normally relatively low, whether the nucleus is a fall or 
a rise, €.g.:— 


ae AS O E NA 


It*s Mot. There were ‘two of them or 
e a . ° o o y . ð e . Py 
And ,then, —. There were e,leven. And ‚when we a,rrived, —. 


Unaccented syllables before a high head are usually said on a relatively 
low pitch, the head having contrastive prominence in relation to them, 
e.g.:— 


@ e e 
e A a . ° © a 
He's ‘always Jate. It was ‘after ,dinner, —. 


If pre-nuclear unaccented syllables, their weak quality remaining, are 
said on a relatively high pitch (marked `), the effect is more emphatic 
and animated than if they are low in pitch; the utterance may have a 
specially bright, lively, encouraging character before a rise, or an indig- 
nant, quarrelsome note before a fall,' e.g.:— 





a) © T 
(lively) lt is. “There were ,nine of us. 

AN eee 
(indignant) It ‘is. “There were ‘nine of us. 


(2) After first accented syllable.—Within the body, unaccented 
syllables remain on almost the same pitch as the preceding accented 
syllable, e.g.:— 


Lo] o o o s © o o f 
'All of us ‘wanted to ‚help him. ‘Put it ‘on the „table. 
eee 
e o e 
0 o e 
€ . . 


A — _—_ _—Q____ ŘS 


‘Will you be ‘able to ‘come to,morrow? 


If the unaccented syllables are on a higher pitch than the preceding 
accented syllables, a special impression of liveliness, eagerness, impa- 
tience, or encouragement Is again produced, e.g.:— 


! But over-generalization is to be avoided. 
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o 





a m . m 
(annoyance) 'Mary was always late. 
hd . e e o, 
(encouragement) 'Come and sit by me. 
o j q J 
(eagerness) 'Are we ‘nearly ,there? 


It is to be noted that in these cases the accented syllables within the body 
receive extra relative pitch-prominence, the pattern being equivalent to 
a series of rising nuclei. 


(3) Post-nuclear. —Unaccented syllables following a falling nucleus 
remain on a low level, e.g.:— 


AAA ue ee ee E 


He’s a ‘bachelor. There are administrative difficulties. 





After arising nucleus, unaccented syllables continue (or effect) the rise 
(the last syllable of all having a short rising glide sometimes, which gives 
it an extra prominence without contrastive significance), C.8.:— 


@ . 0 r E 
o ° A e . 2 
'Put it 'on the ,table. It’s ¡not im,possible. 
o ° " 
ə . @ e . 


Has he ¡got a so/licitor? 


The rise of a falling-rising nucleus may be spead over the following 
unaccented syllables, e.g.:— 


: a. * 
He ‘doesn’t ‘like “criticism. 
Between a fall and a rise, unaccented syllables remain relatively low, 
€.g.:— 


ui e e E, 


\Yesterday was ,fine. 


10.5.2 The Grammatical and Attitudinal Functions of Intonation 


As was pointed out in §10.5, intonation, in addition to its function of 
providing a means of accentuation (or emphatic focus), may also serve 
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to distinguish sentence types (e.g. statement and question) and to 
indicate the emotional attitude of the speaker. Such functions apply 
equally to utterances consisting of more than one word and to those ofa 
single word (of one or more syllables). In these cases, it is not so much 
the situation of the nucleus which is of importance, but rather the type 
of nuclear tone employed, i.e. whether a fall, rise-fall, rise, or fall-riseis 
used. Thus, a statement form of words may be made into a question ifa 
rise is used instead of a fall, e.g. ‘He’s ‘not there” (statement) v. “He's 
not ‘there?’ (question); or, in such a case as that quoted by H. E. 
Palmer,' the sentence ‘He doesn’t lend his books to anybody’ may have 
two meanings according to whether anybody is said with a falling 
nucleus (i.e. he lends them to nobody) or with a falling-rising nucleus 
(i.e. he does lend them to some people). This semantic function of 
intonation (typified by the last example) occurs less frequently than that 
which shows the speaker’s emotional attitude: compare, for instance, 
‘It’s ‘all right” (a plain statement of fact), ‘It’s ‘all ,right’ (an encour- 
agement), and ‘It’s ¡all ,right’ (a grudging or petulant agreement). In 
this connection, in addition to the type of nuclear tone used, the pitch of 
that part of the utterance preceding the nucleus (the head) is also of 
great importance (and sometimes the pitch of unaccented syllables— 
see §10.5.1.4(1)). 

In the examples given in the following sections, an attempt is made to 
assign generalized verbal descriptions to the attitudes conveyed by 
intonation patterns in respect of various types of utterance; the main 
types of utterance are: — 


(a) assertions 

(b) questions containing an interrogative word (X-questions, also 
called wh-questions or open questions) 

(c) questions expecting ‘Yes’ or ‘No’ as an answer (Yes/No ques- 
tions, also called closed questions) 

(d) question tags 

(e) commands, requests, etc. 

(fJ) exclamations, greetings, etc. 


The examples are given as isolated utterances, but it should be 
remembered that the attitudinal meaning of an utterance must always 
be interpreted within a context, both of the situation (see also §10.5.3 
below) and also of the speaker’s personality. It may well happen that an 
intonation which is neutral in one set of circumstances might be, for 
instance, offensive or patronizing when used by another person or in 
other circumstances. 


! See H. E. Palmer, English Intonation with systematic exercises, Heffer, 1922; also 
W. R. Lee, An English Intonation Reader, Macmillan, 1960. 
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10.5.2.1 Falling Nuclear Tones 


A falling nuclear tone, whether , or `, is generally matter-of-fact, 
separative, and assertive, the higher the fall the more vigorous the 
degree of finality implied. No explicit appeal is made to the listener, yet 
the pattern is not necessarily impolite; a conversation amongst people 
who are intimately acquainted might, for instance, exhibit a preponder- 
ance of falling intonations, without the exchange being querulous or 
lacking the social courtesies of speech. 


(1) Low-falling nuclear tone 


Pattern—nucleus (tail) 

(a) \No. ¡Yesterday (detached, unexcited) 

(b) \When? (curt) How are you going to ¡do it? (weak insistence on 
how) 

(c) ¡Are you coming?' Does he want to?! (curt, impatient, testy) 

(d) (He \does,) doesn’t he? (\Lovely,) \isn’t it? (calmly presupposing 
agreement) 

(e) ¡Count them. ¡Get it then (calm, detached, peremptory) 

($) Tragic! (quietly sympathetic; or distant and unmoved, 
especially if the fall does not reach the lowest level); ¡Morning 
(perfunctory) 


Pattern—high head, nucleus (tail), the high pre-nuclear pattern 
showing a degree of lively interest. 

(a) The ‘parcel arrived on Thursday (matter-of-fact, but interested) 

(b) "What do you 'want to do? (blunt to strangers, but a common 
unemotional form amongst intimates) 

(c) 'Are you going? (peremptory, impatient) 

(e) ‘Put it ‘over there (polite, neutral) 

(f) "What a mess (phlegmatic, mild) 
'C) Good \morning (the high pitch on good, whether it is an 
accented head or an unaccented high pre-nuclear syllable, makes 
the greeting pleasant, though routine) 


Pattern—low head, nucleus (tail); the low pre-nuclear pattern throws 
the nucleus into greater prominence and often shows lack of interest. 
(a) It’s ¡all we could expect (surly, uninterested) 
(b) ¡What are we going to do? (resigned, bored) 
(c) ¡Have you ¡got the tickets? (uninvolved, perfunctory) 
(e) ¡Leave it ¡on the \table (preoccupied, expecting to be obeyed as a 
matter of course) 


1 The insistence or curtness here is caused partly by the placement of the nucleus on the 
auxiliary verb. It should be noted that nucleus placement and choice of nuclear tone are 
separable variables. 
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(J) How a\nnoying (bored, unconcerned, sarcastic) 
¡Good \morning (routine, perfunctory greeting) 


(2) High-falling nuclear tone.—Animated; may be strongly contras- 
tive or contradictory; often showing strong indignation or excitement; 
very common in ordinary colloquial speech. 


Pattern.—(pre-head), nucleus (tail). 

(a) ‘No. It ‘was. Of ‘course it ,isn’t (vigorous agreement or con- 
tradiction) | 

(b) “Why? ‘How ¡can she? (surprise, indignation, incredulity) 

(c) Do ‘you ¡like it? You’re ‘sure? (insistence, demands as answer) 

(d) (She ‘doesn’t), ‘does she? (He ‘does), ‘doesn’t he? (demands 
agreement) 

(e) ‘Drop it. ‘Cancel it then (energetic, angry command) 

($) ‘What a surprise! (strong surprise, indignation) 
“Good morning (hearty greeting) 


Pattern.—low head, nucleus (tail); a high-falling nuclear tone 
usually depends for its effect on a relatively low pattern of any preced- 
ing syllables, all the accentual force being concentrated in the nucleus; 
the following sentences said with high heads (and bodies) would have 
less contrast associated with the nuclear word. 

(a) ¡Not at ‘all. I quite a‘gree. 

(b) Why do you ‘want it? 

(c) ,Can we ,have it ‘now? 

(e) ¡Go and ‘find it. Í 

($) What a stupid mistake! 

¡Good ‘morning (a bright, cheerful greeting) 


(3) Rising-falling nuclear tone.—The initial rise may reinforce the 
meaning of any high fall, often with additional warmth, indignation, 
sarcasm, etc. A preceding head will usually be relatively low. 


Pattern—(low head) nucleus (tail). 

(a) “Yes. It was a” palling. A,bout “time (all may show enthusiastic 
agreement or enthusiasm; but “Yes”, said slowly, expresses 
doubt, /“pa:1/, prolonged, shows horror, and ‘ time’ may 
reveal sarcasm or indignation) 

(b) ¡What does his ° father ,do? (suspicious, indignant interest) 

(c) ¡Can you be “sure (mocking, knowing, suspicious) 

(d) “Is she? “Doesn't he? (incredulous, mocking, indignant) 

(e) ¡Get a nother one (pert, haughty, sulky, indignant) 

(f) ,Oh, indeed. ¡How “nice for you (sarcastic) 

¡Good ° morning (portentous, ironical greeting) 


Connected Speech 283 


10.5.2.2 Rising Nuclear Tones 


(1) Low-rising nuclear tone.—Essentially unfinished and continuative, 
often with overtones of politeness, encouragement, pleading, diffi- 
dence, suspicion, etc. 


Pattern—nucleus (tail). 

(a) ,No. ,Possibly (tentative, grudging, encouraging). Note also the 
introductory usage: ‘,Then,—,’ and adverbially after a main 
clause: ‘(We were aMone), luckily.’ 

(b) ¡How did you doit? (insistence on ‘How’—the lower the starting 
point of the rise, the greater the insistence) 

(c) ¡Are they coming? (insistence on ‘Are’) 

¡Is he? (doubtful, indifferent) 

(d) (He’s ‘got one), ,hasn’t he? (It isn’t ‘there), ,is it? (doubtful, 
asking for information) 

(e) ,Wait. ¡Hold it (gentle command or request) 

(f) ¡Well (introducing a topic, or an uninterested question form) 


Pattern—high head, nucleus (tail); a relatively high pre-nuclear 
pattern gives an effect of a fresh thought, lively interest, appeal, 
encouragement, etc. 

(a) It’s ‘all right. It 'doesn't ,matter. She 'won't be ,long (reassuring 
statements). Also dependent clauses occurring before the main 
clause, e.g. “When we a,rrived, (they’d al'ready gone); follow- 
ing a main clause containing a falling nucleus, the pre-nuclear 
pattern of this dependent clause is likely to be low, e.g. ‘(They’d 
al'ready gone), ¡when we a,rrived’. 

(b) "What's the ,time? (polite inquiry) 

(c) 'Can you ,come? (polite, interested) 

(e) ‘Sit ,down. ‘Come over ,here (pleasant, encouraging invitation) 

(f) 'All'the ,best. 'Good ,luck (cheerful good wishes). ‘Good ,morn- 
ing (cheerful, friendly greeting) 


Pattern—low head, nucleus (tail); a relatively low pre-nuclear 
pattern often signifies complaint, suspicion, veiled threat, lack of 
interest or enthusiasm, etc. 

(a) It’s ¡all ,right. It’s ¡not 1m,portant (resigned, disgruntled, long- 
suffering, bored). Also in dependent clauses, before or after the 
main clause, e.g. “If ,JJack’s ,home, (we’ll come at ‘eight)’; 
‘(We’ll come at ‘eight), if Jack*s ,home’; in the first case, ‘Jack’ 
has not the contrastive accentual force which would be asso- 
ciated with “Jack”. 

(b) What have you been ,doing? (unsympathetic, menacing, 
threatening) 

¡What's your ,¡name? (peremptory, routine cross-examination) 

(c) ¡Can you come ,next week? (uninterested, disgruntled, bored) 
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(e) ¡Try a,gain (routine request, peremptory) 
‘Don’t leave the ,door ,open (long-suffering, complaining) 
($) ¡Good ,morning (polite but perfunctory greeting) 


(2) High-rising nuclear tone.—This nuclear tone, rising to a high 
pitch, is associated essentially with questions, e.g.:— 


(i) An elliptical question (showing eagerness, brightness, enthu- 
siasm, or asking for a repetition):— 

‘Coffee? (= ‘Will you have some more coffee?’ or ‘Did you say 

coffee?’) i 

‘Like it? (= ‘Do you like it?’) 

“Monday? (= ‘What about Monday?’ or ‘Did you say Monday?’) 

‘John? (= ‘I’m calling you—are you there?’ or ‘Is that you, John?’ 

or ‘Did you say ‘‘John’’?’) 

‘When? (= ‘Would you repeat what you said?’) 


(11) A question showing great eagerness, excitement, concern, 
indignation, etc.:— 

It ‘is? "You “did? (surprise, incredulity) 

¡Can you ‘come? (eager expectancy) 

You ,actually ‘saw him? ¡What “me? (indignation, surprise, horror) 

¡Can we a’fford it? (concern, expectancy, apprehension) 

It ‘wasn’t ‘yours! (dismay, surprise, indignation) 


10.5.2.3 Falling-Rising Nuclear Tones 


The falling-rising nuclear tone combines the dominant effect of the fall 
(contradictory, contrastive) with any of the emotional or meaningful 
attitudes (not expressed verbally) associated with the rise. Both fall and 
rise may occur within one word, e.g.:— 


Pattern—(accented or unaccented pre-nuclear syllables), nuclear 
word (tail). 
(a) “No (doubtful or encouraging) 
He could (forcefully reproachful) 
“Sometimes (encouraging) 
I ‘haven’t ‘much “appetite (‘but I’ll join you to be polite’; if 
the pre-nuclear pattern is low, the effect is less agreeable— 
disgruntled) 
Pm waiting (‘so do hurry up’) 
“Lately, (it’s been ¡too ‘wet) (lively, introductory, con- 
tinuative) 
If he could, (we*d be delighted) (forceful dependent clause) 
(b) “When? How? (forceful, encouraging, prompting) 
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(e), (f) “John! “Look! (an appealing, inviting summons) 
Gently! (encouraging, soothing, warning) 


or the fall and the rise may occur on different, accented, words, e.g.:— 


Pattern—(accented or unaccented pre-nuclear syllables), word con- 
taining fall (intervening syllables, with or without accent), word con- 
taining rise (tail). 

(a) ‘He ,could (‘but I doubt whether you could’) 

It wasn’t ‘your ,fault (reasoning, sympathetic) 

‘That’s the way to ,do it (encouraging, pleasant) 

(He’ll ‘have to), ‘one ,day (insistence on one, followed by the rise 
of the dependent clause) 

On ‘Saturday ,morning, (we’ll be going ‘out) (contrastive fall on 
Saturday followed by continuative, introductory rise) 

(b) “When can you ,come? (a polite questioning rise, preceded by a 
fall on when to insist on the precise time) 

(c) Will ‘you be ¡coming on ,Sunday? (a polite questioning rise, 
preceded by a fall on you to focus aitention on your intended 
action) 

‘Can ,you? (insistence on your ability to do something associated 
with the polite questioning rise) 

(e) ‘Do ¡sit ,down. ‘See if you ,can (pressing request) 

“Mind ‚how you ,go (strong but sympathetic warning) 

(D) ‘Well ¡done! (warm, sympathetically appreciative) 


10.5.2.4 Multi-nuclear Patterns 


It often happens that, in addition to the composite nuclear patterns 
consisting of a fall + a rise, a number of nuclei occur in a single 
syntactic group. Such a series of nuclear syllables, often occurring 
rhythmically at equal intervals of time, may produce an effect of a cate- 
goric, downright, hectoring, insistent, self-assertive way of speaking. 
Nevertheless, such multi-nuclear patterns are extremely common in 
ordinary conversation and often serve no other purpose than to pro- 
duce a lively, animated effect and to focus attention on the important 
words of an utterance, e.g.:— 


(1) A series of falling nuclear tones preceding a fall. —An emphatic 
equivalent of a head (+ body) + falling nucleus. 
(a) We ‘never ‘thought he ‘had a ‘chance 
It was ‘all we could ‘do to ‘stop him 
(b) ‘What do you ‘think we can ‘do? 
(c) ‘Can she ‘do the ‘work? 
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(e) ‘Don’t be ‘such a ‘fool 
“Go and ‘try again. 
(f) ‘What an ‘awful ‘mess. ¡Good ‘after‘noon 


(2) A series of falling nuclear tones preceding a rise. —An emphatic 
equivalent of a head (+ body) + rising nucleus (or fall-rise). 
(a) I ‘always ‘thought he ,would 
I was ‘most impressed that ‘they should ,want to 
It ‘looks as ‘black as ‘night ¡out' side 
(b) ‘How many ‘seats were you ‘able to ,get? 
(c) Are you ‘sure that ‘George and ‘Mary ¿know? 
(e) ‘Mind you ‘put your ‘hat and ‘coat ,on 
“Come and ‘see us ‘when you can 
($) ‘What a ‘very ‘silly ¡little ,boy! 


(3) A series of rising nuclear tones preceding a fall.—An emphatic, 
impatient, or patronizing equivalent of a head (+ body) + falling 
nucleus. 

(a) And ¡now you ,both must ,go to ‘bed (decisive, but humouring) 
It’s ,only ,in your ,own ‘interests (strongly persuasive, somewhat 
impatient) 

(b) ¡What on ,earth do you ,think you're ‘doing? 

(c) ¡Can you ex,pect them to ,do it alone? 

(e) ¡Don't be ,such a ,silly ‘fool! 

(The last three examples showing impatience, exasperation.) 


(4) A series of rising nuclear tones preceding a rise.—A repetitive 
indication of the attitudes associated with a rising nuclear tone (appeal, 
complaint, sarcasm, etc.), being the equivalent of a head (+ body) + 
low rise. 

(a) You ¡know we ,only ,wanted to ,help (patronizing, grudging) 

(b) ¡When were you ,thinking of ,paying it ,back? (patronizing, 

Sarcastic) 

(c) ¡Will you be ,coming to ,see us on ,Monday? (apprehensive, 

unenthusiastic) 

(e) ¡Don't make ,such a ,noise! (quietly pleading, the appeal being 

stronger as a result of the succession of rises) 

The same sentences said with a series of higher rises would give an 
effect of greater animation, excitement, urgency, etc. 


10.5.2.5 Other Devices for Signalling Attitudes 


Many other devices exist for expressing in sound the mood of a speaker 
in addition to the actual words which he uses. A rapid rate of delivery, 
for instance, may express irritation or urgency, whereas a slower rate 
may show hesitancy, doubt, or boredom in statements, or sympathy or 
encouragement in questions and commands; a repeated alveolar click 
denotes discontent; an egressive voiceless air-stream with friction at the 
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rounded lips and a falling pitch (produced by the adjustment of the 
mouth cavity by movement of the tongue) expresses surprise, admira- 
tion, relief, whereas an ingressive air-stream of the same type may, in 
addition, be used to signify pain or pleasurable anticipation; and the 
utterance may be punctuated by sighs, denoting boredom, impatience, 
or sorrow. The intonation of a sentence, however, may indicate 
attitudes of the speaker other than those associated with the type of 
nucleus used or the choice of a relatively low or high pre-nuclear pattern 
of syllables: — 


(1) Range of intonation.—The degree of enthusiasm, liveliness, 
interest, in those tunes where the pre-nuclear pattern is relatively high 
will be greater the higher the pitch of the head, e.g. with an extra-high 
pitch on the syllable italicized in the following examples and a con- 
sequent greater pitch separation of any accented syllables between the 
head and nucleus:— 


(a) It’s ‘awfully „good 
Everyone 'thought it was marvellous 
It’s ‘not as bad as you ,think 

(b) 'What’s the ,time? 

(c) ‘Are you \sure? 

(e) 'Look at that 

($) Well 1 ¡never 
'Good 'after,noon 


(2) Continuous level pitch.—In certain situations (excluding the 
monotonous chant of prayers sometimes heard in church), a con- 
tinuous level pitch may be used to express acurt, detached, preoccupied 
attitude, which may give the impression of overbearing superiority if 
the pitch is relatively high, or of surly impatience if the pitch is relatively 
low:— 


iThat’ll be ,all. It’s not a bit of ¡good 

¡What do you want? ¡Are you coming? 

¿Come in. ¡How do you ,do? 

(The accented syllables are here marked with ,, since, whether they 
are high or low, they do not have pitch prominence.) 


10.5.3 The Discourse Function of Intonation! 


As was pointed out in §10.5, intonation has a complex linking function 
in dialogue and within the extended connected speech of a single 
speaker. To some extent this function may be seen as overlapping with 
the grammatical function, differing syntactic structures which consist 


l This is too large a topic to go into in detail here. Much research is currently being 
carried out in the field: see entries in the bibliography under Brazil, Couper-Kuhlen, 
Gussenhoven, House, Ladd. 
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of the same sequence of words tending to display different intonation 
patterns, e.g. “He re‘signed ,sadly’ (= the speaker is sorry to report 
someone else’s resignation) v. ‘He resigned ‘sadly’ (= the man who 
resigned was sad to do so). However, varying intonation is a matter not 
only of syntax but also of pragmatics which is concerned with the inter- 
pretation of utterances. So, for instance, it may be true on occasion that 
‘He’s ‘finished’ (with a falling nuclear tone) is a statement whilst ‘He’s 
/finished’ (with a rising nuclear tone) has the force of a question. 
However, in some particular conversational context, the version witha 
rising nuclear tone might be a defensive statement: ‘He’s ,finished’ (so 
why are you complaining?). 


10.6 Advice to Foreign Learners 


Of all the features of accent in connected speech, the foreign learner 
should pay particular attention to:— 

(1) The choice of words upon which the speaker’s stress and accent 
(primary and secondary) fall and the situation of the nucleus. This 
placing of the accent in connected speech will usually be determined by 
the meaning which is to be conveyed, subject to the constraints imposed 
by the context and total situation. 

(2) The rhythmic nature of connected English. The accented 
syllables follow each other at roughly equal intervals of time; any inter- 
vening unaccented syllables are said rapidly, the greater the number of 
such syllables the more rapidly they are uttered; unaccented syllables 
are associated with the hub of an accented syllable, so that an utterance 
is divisible into a series of rhythmic groups, themselves closely related 
usually to syntactic groups. 

(3) The related weakening of unaccented words, e.g. the appropriate 
use of weak forms. A strong form in what is intended to be an 
unaccented position will give an impression of accent to the English ear. 
Thus, He was late, pronounced /hi: woz ‘leit/ instead of /h1 (or [hi)) 
waz ‘leit/, gives undue prominence to was and may appear to contradict 
some such statement as He wasn’t late. 

He should also be aware of the attitudinal implications of pitch 
variation. Frequently, the tunes used in English to distinguish sentence 
types will, in their broad pattern, be similar to those of his own lan- 
guage; but he should note, for instance, the high frequency of occur- 
rence among English RP speakers of falling-rising tunes (often withina 
single word), which are less commonly encountered in other languages. 
He should be acquainted, too, with the English usage of falls and rises 
to signify the mood of the speaker, so that any over-use of rises will not 
give an unintentional impression of, for example, diffidence or 
complaint, and too many falls create an unwitting effect of impolite 
assertiveness. 


11 
The Word in Connected Speech 


11.1 Every utterance is a continuous, changing pattern of sound 
quality with associated (prosodic) features of quantity, pitch, and 
stress. The word (consisting of one or several morphemes) is, like the 
phoneme, an abstraction from this continuum and must be expected to 
be realized in phonetically different ways according to the context, cf. 
the various allophonic (phonetic) realizations of the abstract unit 
known as the phoneme. The word constitutes, however, a separable 
linguistic reality for the speaker. Whether it has a simple or a complex 
morphemic structure, it is an element of language which is commutable 
in an utterance with other members of its class, i.e. nouns for other 
nouns, verbs for other verbs, etc. It is, moreover, often capable of 
constituting an utterance by itself. It must, therefore, be considered as 
an abstraction ona higher level than the phoneme, its separable identity 
having been recognized in the sophisticated written form of English by 
the use of spaces between words. If however, the word is admitted as an 
abstracted linguistic unit, it is important to note the differences which 
may exist between its concrete realization when said (often artificially) 
in isolation and those which it has when, in connected speech, it is 
subject to the pressures of its sound environment or of the accentual or 
rhythmic group of which it forms part. Those word forms which are 
typical of connected speech are often known as special context forms. 
The variations involved may affect the word as a whole, e.g. weak 
forms in an unaccented situation or word accentual patterns within the 
larger rhythmic pattern of the complete utterance; or may affect more 
particularly the sounds used at word boundaries, such changes 
involving a consideration of the features of morpheme and word 
junctures, junctural assimilations, elision, and liaison forms. In 
addition, it will be seen that the extent of variation depends largely 
upon the speed of utterance, the slower or more careful and formal the 
delivery the greater the tendency to preserve a form nearer to that of the 
isolate word. 
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11.1 Neutralization of Weak Forms 


We have seen already (§10.4(2)) that a number of form words may have 
different pronunciations according to whether they are accented (or 
said in isolation) or, more typically, are unaccented. Such is the reduc- 
tion and obscuration of the unaccented forms that words which are 
phonetically and phonemically separate when said in isolation may be 
neutralized under weak accent. Such neutralization causes no con- 
fusion because of the high rate of redundancy of meaningful cues in 
English; it is only rarely that the context will allow a variety of inter- 
pretation for any one cue supplied by an unaccented word form. The 
examples of neutralization which follow might occur in rapid, familiar 
RP:— 


/9/ = unaccented are, a (and, less commonly, her, or, of) 

The ‘plays are ¡poor 

He ‘plays a poor man 

She 'wants a dog 

She 'wants her \dog (less rapidly, with reduced /3:/ for 
her) 

‘One or 'two of them are coming (or /9:/ for or, /əv/ for 
of) 

‘Two ‘books are mine 

Two ‘books of \mine (or, less rapidly, /av/ for of) 


/av/ = unaccented have (aux.), of 
Some of one, .. . 
‘Some have ,won,.. . 
The ‘boys of Eton fish 
The 'boys have ‘eaten \fish 
(The last two utterances being identical, the meaning is clear 
only from the larger context.) 


/or/ = unaccented are, or 
'Ten or under (less rapidly, /9:/ for or) 
'Ten are \under 


/03/ = unaccented the, there 
There 'seems a ‚chance 
The ‘seams are crooked 


/s/ = unaccented is, has, does 
What's Cs = does or is) he Jike? 
What's Cs = has) he Jost? 


/z/ = unaccented is, has, does 
Where's (s = has, less commonly does) he ¡put it? 
Where's Cs = is) he going? 
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/az/ = unaccented as, has 
J | 
How much das he \done? 
| 
As much as he ¡can 


/an/ = unaccented and, an 
‘On and off 
'On an \off-,chance 


/n/ = unaccented and, not 
'Didn’t he ,do it? /‘didni:/ 
He ‘did and he didn’t /hi ‘did n i: \didnt/ 


/d/ = unaccented had, would 
Pd (Cd = had, would) ‘put it \here 


11.2 Variation of the Word’s Accentual Pattern; Stress Shift 


The accentual (rhythmic) pattern of a word generally remains constant 
whatever the environment, retaining its rhythmic identity in the total 
rhythmic grouping of the longer utterance (§§10.3, 10.4). Although a 
word may lose, in connected speech, the nuclear pitch change which it 
has in isolation, the relation of primary and secondary accents is not 
changed, e.g.:— 


behind; ¡get behind me; behind the book case 

wind screen; “wind screen ,wiper; the 'wind screen was ‘smashed; 
he bought a new “wind screen 

\yesterday; I ¡saw him ‘yesterday; ‘yesterday ‘morning 

Most ,office; 'post ¡office clerk; ‘near the ‘post ¡office 


But it happens that when a word (simple or compound) pattern 
consists in isolation of a primary accent preceded by a secondary 
accent, e.g. e Y» o Y +» o » Y + ctc., the primary accent 
may be thrown back to the syllable carrying secondary stress in 
isolation, if, in connected speech, a strong accent follows closely, 
e.g.:— 


'thir,\teen, but thir teen \shillings 
"Westminster, but 'West,minster ‘Abbey 
fullgrown, but a full grown ‘man 
afternoon, but ‘after noon ‘tea 


It will be noted, too, that, when a strongly accented syllable closely 
precedes, the potential pitch-prominent secondary accent may be 
reduced to one of quality, quantity, or rhythm, without pitch- 
prominence, e.g.:— 


‘eight „thirteen; ‘near ,West‘minster; ‘not ,full‘grown; ‘Friday 
¡afternoon 
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Moreover, when the primary accent is shifted back, in the case of a 
strong accent following, the secondary accent which falls on the syllable 
having primary accent in isolation frequently has no pitch-prominence, 
and may, if the quality of the syllable permits, receive no accentual 
prominence of any kind, e.g. 'West,minster ‘Abbey or ‘Westminster 
‘Abbey. 

Although because the primary accent may be regarded as being 
‘shifted back’ this process is often referred to as stress shift, it might be 
more accurate to say that the primary accent of the first word is lost 
whilst the secondary accent (on the earlier syllable) remains. Syllables 
capable of retaining such secondary accent are exemplified in §9.3 with 
the representation *. It may already have been seen from those exam- 
ples that English tends to the alternation of accented and unaccented 
syllables. 

It is in order to avoid the placing of primary accents on adjacent 
syllables that stress shift occurs in phrases such as ‘Chinese ‘restaurant 
(but Chinese), outside “world (but ‘out\side), aquamarine ‘necklace. 
Cf. aquama'rine tiara, diplo'matic ‘incident, where the stress-shifted 
forms are optional (aquamarine tiara, ‘diplomatic incident), the 
accents not occurring on adjacent syllables in either case. 

It may be added that this tendency to the alternation of accented and 
unaccented syllables is so strong that the stress may be shifted in the 
case of certain words whose citation form contains only one, later, 
stress, e.g. ornate but ornate ‘carvings; unique but ‘unique features; 
and direct but ‘direct ‘access (this latter, in those idiolects which pro- 
nounce /do‘rekt/, necessitating a change of vowel phoneme /'‘dairekt 
\eekses/). The alternation tendency extends into longer utterances and 
may be seen in examples such as ¡dea but the ‘idea ‘pleases me; 
recommend but I can ‘recommend ‘several; and in phrasal verbs such 
as ‘come ‘out, ‘get Vin, e.g. The ‘pictures ‘didn’t come ‘out, but They 
‘came out ‘well; and ' What 'time will ‘you get Vin? but 'What ‘time will 
you ‘get in from work? 


11.3 Phonetic Variations at Word or Morpheme Boundaries 


We have seen (Chapter 5) that our basic linguistic units (the phonemes) 
are derived by a process of abstraction from connected speech. If the 
phoneme /t/ is given a convenient, generalized label—a fortis voiceless 
alveolar plosive—it is nevertheless true that the actual phonetic 
realization of this consonant depends on the nature of the context, e.g. 
/t/ is aspirated when associated with accent (except after /s/), and 
dental, rather than alveolar, when adjacent to /8/ or /0/. Phonetically, 
we are dealing with a sound and articulatory continuum rather than 
with discrete units: features of sound segment A may be found in a 
following segment B, and features of B in A (cf. ‘transitions’ of 


The Word in Connected Speech 293 


consonants, §8.2.2). If, therefore, the utterance is analysed in terms 
of a sequence of phonemes, account must be taken of the phonetic 
continuity and merging of qualities by describing the mutual influence 
which contiguous elements exert upon each other; in other words, 
tendencies towards ‘assimilation’ have to be noted. The tendencies 
outlined below are valid for colloquial English RP but may not be 
characteristic of other languages. 

Variations of articulation may be of an allophonic kind, either within 
a word or at word boundaries; or, at word and morpheme boundaries, 
they may be of such an extent that a change of phoneme is involved, 
as between the pronunciation of a word in isolation and that which it 
may have in context. The fact that the phonemic pattern of a word is 
subject to variation emphasizes the potential nature of phonemic 
oppositions. The meaning of a word derives as much from the situation 
and context in which it occurs as from its precise phonemic shape, 
the high redundancy of English tolerating considerable confusion at 
the phonemic level. The mutual influence of contiguous phonemes in 
English may function predominantly in a regressive or anticipatory 
direction, i.e. features of one phoneme are anticipated in the articula- 
tion of the preceding phoneme; or it is progressive or perseverative, 1.e. 
one phoneme markedly influences the following phoneme; and, some- 
times, a fusion or coalescence of phonemes may take place. 


11.3.1 Allophonic Variations 


Since the realization of any phoneme differs according to the context in 
which it occurs, it is necessary to give examples only of those variants 
which exhibit striking changes of phonemic features. The influence of 
the phonetic features of a segment on the production of an adjacent 
segment is a matter of co-articulatory effects. The same types of allo- 
phonic variation, involving a change of place of articulation, voicing, 
lip position, or position of the soft palate, may be found within the 
word and also at word boundaries:— 


(1) Place of mouth articulation 

(a) within word:— 
/t/—post-alveolar in try (influence of [1]) 

dental in eighth (influence of [6])) 

/k/'—advanced (pre-velar) in key (influence of [1:]}) 
/n/—dental in tenth (influence of [8]) 
/m/ or /n/—labio-dental in nymph, infant (influence of [f]) 
/a/ —retracted in result (influence of [tp 


! Note the combinative OE [c, 1] > [tf, d3] before a front vowel, involving a consider- 
able phonetic shift in the nature of the phoneme. 
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/u:/—centralized in music (influence of [}]) 

(b) at word boundaries: — 

/t/—dental in not that (influence of [0]) 

/d/—dental in hide them (influence of [0]) 

/n/ or /m/—labio-dental ten forks, come for me (influence of [f]) 
/s/—retracted in this road (influence of [1]) 


(2) Voice—usually devoicing of continuants following a fortis 
consonant. 

(a) within word:— 

/r/—devoiced in cry; /1/—in plight; /w/—in quite 

/3/—in queue 

/m, n, n/ are slightly devoiced in such words as smoke, snow, 

mutton, open (when pronounced /‘3upm/), bacon 
(when /‘beikn/) 

(b) at word boundaries (such variations being restricted to cases 
where a sequence of words forms a close-knit entity—a phrasal word or 
rhythmic group):— 

/1/—in at last [ala:st]; /r/—in at rest [a‘trest]; /w/'—in at once 
[aMwans]; /3/—in thank you [eenkju]. 

Note also the devoicing of word final lenis fricative or plosive 
consonants before silence, and the fricatives when followed by a fortis 
consonant, and of word initial lenis fricative or plosive consonants 
when preceded by silence, e.g. in What can you give? ([v]); Can you 
breathe? ([91); It’s his ([z]); Near the bridge ([d3]); they’ve ([v]) come; 
with ([9]) some; He's ([z]) seen it; George ([d3]) can; ([v]) very good 
([d]); (19D) there; ([z]) zinc does ([z]). 


(3) Lip position—under the influence of adjacent vowels or semi- 
vowels. (a) within word:— 


lip-spread lip-rounded? 


/p/ pea, heap pool, hoop, upward 

/t/ tea, beat two, boot, twice, outward 
/k/ keep, speak cool, spook, quite, backward 
/m/ mean, seem moon,loom, somewhat 

/n/ knee, seen noon, onward 

// leave, feel loom, fool, always 

/1/* read rude 

/1/ feel, leaf fool, roof 


| Note also the devoicing /w/ < /u/ in to followed by a vowel, e.g. see to it [si : tw 1t]. 

¿This will apply only for those speakers who have appreciable rounding of the vowel. 

3 For some speakers /r/ has inherent labialization and will not be lip-spread even before 
a lip-spread vowel. 
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/s/ seat, geese Soon, goose, Sweep 
/{/ sheet, leash shoot, douche, dishwasher 
/h/ he who etc. 


(b) at word boundaries (any marked labialization being dependent 
upon the close-knit nature of the group): e.g. /t, k, n, p, l, s/ are some- 
what labialized in such cases as that one, thick one, thin one, wrong 
one, Shall we, this way, the syllables with initial /w/ carrying no accent; 
a rounded vowel in an adjacent word does not seem to exert the same 
labializing influence, e.g. /u:/ does not labialize /s/ markedly in Who 
said that? nor /2:/ in This ought to. 


(4) Nasal resonance—resulting particularly from anticipatory but 
also from prolonged lowering of the soft palate in the vicinity of a nasal 
consonant. 

(a) within word: nasalization of vowel preceding /m/ in ham and 
/n/ in and, of vowel between nasal consonants in man, men, inner- 
most, and of short vowels on each side of the nasal consonant in any, 
sunny, summer, singer; also /l/ in such situations as in helmet, 
wrongly; and possible slight nasalization of vowel following /n/, as in 
now. 

(b) at word boundaries: vowels may sometimes be nasalized some- 
what by the boundary nasal consonant of an adjacent word, especially 
when an adjacent nasal consonant also occurs in the word containing 
the vowel, e.g. the first /9/ in bring another, or /1/ in come in, and also, 
without an adjacent nasal consonant in the word containing the vowel 
(usually unaccented), e.g. /3/ in come along, wait for me, /1/ in every 
night, etc.; or again, a frictionless continuant consonant may be 
nasalized, e.g. /l/ in tell me. 


11.3.2 Phonemic Variations 


In a synchronic view of the present state of the language, in which a 
word’s internal phonemic pattern is established and largely invariable, 
there can be no question of assimilation involving phonemic change (as 
against allophonic variation). Nevertheless, different pronunciations 
of the same word (either between two speakers or between different 
styles of speech in the same speaker) sometimes exhibit a different 
choice of internal phoneme depending on the degree of assimilatory 
pressure of the word environment felt by the speaker, e.g. length may 
be /len8, lenk8/, or /len6/, encounter may have /in/ or /19/ in the first 
syllable, diserace may have final /s/ or /z/ in the first syllable, 
absolutely may have final /b/ or /p/ in the first syllable, and issue may 
have medial /sj/ or a coalesced form /f/. From a diachronic point of 
view, a phonemic change within a word may sometimes be attributable 
to the combinatory pressures exerted on a phoneme by the word 
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environment, e.g. by labialization /wa(:)/ > /wb/ or /wo:/ (swan, 
water); /1r, er, ur/ coalesced into /ar/ (later /3: /) under the influence of 
the post-vocalic /1/ (first, earth, curse); /s/ or /z/ and /j/ combined 
their phonetic characteristics to give /f, 3/ (mansion, vision), etc. 

But, in the contemporary language, it is at word boundaries in con- 
nected speech that most cases of phonemic change occur (1.e. change as 
compared with the phonemic pattern of the isolate word form). Such 
phonemic variation is found in changes within the pairs of fortis/lenis 
phonemes and, more particularly, in changes involving modification of 
the place of articulation, or a combination of voicing and place. 


11.3.3 Fortis/Lenis Variations 


Word final lenis fricatives followed by a word initial fortis consonant 
may with some speakers be realized as the corresponding fortis 
fricative, if the two words form part of a close-knit group. Thus, isolate 
final /0/ of with may be replaced by /8/ in with thanks; /z/ of was by 
/s/ in he was sent; /v/ of of, we’ve, by /f/ in of course, we’ve found it. 
Such a change to a fortis fricative is an extension of the allophonic 
devoicing of such consonants mentioned in §11.3.1(2). The phonemic 
change may be complete in that a preceding long vowel or diphthong 
may be realized in the reduced form appropriate to asyllable closed bya 
fortis consonant, e.g. in breathe, these, we’ve, chose, in the examples 
given above, but this is relatively rare. 

The weak form of is or has is /s/ or/z/ according to the final con- 
sonant of the preceding word, e.g. the cat has or is with /s/ (= the 
plural or possessive morpheme in cats, cat’s), cf. the dog has or is with 
/z/ (= the plural or possessive morpheme in dogs, dog’s). 

It is unusual in RP for word final /b, d, g/ to be influenced in the 
same way by following fortis consonants, though voiceless forms may 
be heard in such contexts in northern speech, e.g. the /d/ of good time 
and the /g/ of big case may be realized as /t, k/. 

It is to be noted that word or morpheme final fortis consonants in 
English rarely show tendencies to assimilate to their lenis counterparts: 
such pronunciations of nice boy, black dress, half-done, they both do, 
wishbone, birthday, as /‘naiz \bo1, 'bleeg \dres, 'ha:vidan, del baud 
\du:, \wi3,baun, \b3:0,dei/ are typical of many foreign learners. 


11.3.4 Nasality and labialization 


Phonemic assimilations involving nasality (i.e. anticipation or pro- 
longation of the lowered soft palate position) would be likely to show 
/b/ (or /v/) > /m/, /d/ (or /z/ or /0/) > /n/, /g/ > /n/, such 
changes being based on roughly homorganic mouth articulations; 
nasalization of other sounds, e.g. /1/ or vowels, is never phonemic, 
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there being no nasalized counterpart with approximately homorganic 
mouth articulation. Such phonemic nasalization as does occur concerns 
mainly the alveolars, especially adjacent to the negative "nt /n(t)/, and 
is characteristic of very rapid speech, often as a popular form unaccept- 
able in RP (marked * in the examples below), e.g.:— 


/d > n/—He wouldn't do it /hi wonn(t) du: 1t/ 
good news /'gun mju: z/ 

/d > g > n/—He wouldn't go /hi 'wunn(k) \gou/ 

/d > b > m/—Good morning /,gum mo:nin/ 

/v > m/— You can have mine /ju kn hem ‘main/ 


(In the above examples, the nasalized assimilated form may be elided 
together.) 


/z > n/—He doesn’t know /hi dann(t) mou/* 
/0 > n/—He wasn’t there /hi ‘wonn(t) neo/* 
to win the race /ta ‘win na wre1s/* 


The extension of /abialization produces no changes of a phonemic 
kind, since lip-position is not a distinctive feature opposing any two 
phonemes in RP. /D/ and /a:/ come nearest to having an opposition of 
lip action, but the lip-rounding for /D/ is very slight and open and, in 
any case, there is some difference of tongue position and considerable 
difference of length. Where /w/ precedes a vowel of the /a:/ type (and, 
therefore, might be expected to exert a rounding influence), either 
labialization has become established at an earlier stage of the 
language’s development (e.g. in was, what, war, water, etc.) or two pro- 
nunciations are to-day permitted, e.g. qualm /kwa:m/ or /kwd:m/, 
quaff /kwof/ or /kwa:f/. Labialization of /a:/ involving a phonemic 
change to /p/ or /9:/ does not extend beyond word boundaries, e.g. in 
two arms or the car won’t go. Some confusion may, however, occur 
between a strongly centralized form of /au/ and /3:/ ina labial context, 
cf. they weren't wanted and they won’t want it; also, with the influence 
of a strongly labialized form of /r/ in such a pair as they weren’t right, 
they won't write. 


11.3.5 Variations of Place: Assimilation 


The assumption by a word-final phoneme (in the isolate form) of the 
phonetic characteristics of a following phoneme in connected speech is 
a frequent result of assimilatory tendencies, usually regressive (anti- 
cipatory) or coalescent, involving a variation in the place of articula- 
tion. Though such changes are normal in colloquial speech, native 
speakers are usually unaware that they are made. The phenomenon is 
essentially the same as that resulting in non-phonemic assimilation of 
place. (See §§8.2.5(2), 8.6.2(2).) 
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Regressive (or Anticipatory) Assimilation: Instablity of final 
alveolars! 


Word final /t, d, n, s, z/ readily assimilate to the place of the following 
word initial consonant, i.e. whilst retaining the original voicing, /t, d, 
n/ are replaced by bilabials before bilabial consonants and by velars 
before velar consonants; /s, z/ are replaced by palato-alveolars before 
consonants containing a palatal feature: 


/t/ + /p/ before /p, b, m/, e.g. that pen, that boy, that man / deep 
pen, ‘Ozep bal, ‘deep men/; 
— /k/ before /k, g/, e.g. that cup, that girl /deek kap, ðæk 
aa 1 
but /p/ does not show similar changes before /t, d, n, k, 
g/ 
nor /k/ before /t, d, n, p, b, m/. 
/d/— /b/ before /p, b, m/, e.g. good pen, good boy, good man 
/‘gub pen, 'gub bai, gub \mzen/; 
— /g/ before /k, g/, e.g. good concert, good girl /'gug \kpnsat, 
'gug\g3:1/; 
but /b/ does not show similar changes before /t, d, n, k, 
g/, 
nor /g/ before /t, d, n, p, b, m/. 
/n/—/m/ before /p, b, m/, e.g. ten players, ten boys, ten men 
/'tem plez, 'tem \boiz, tem \men/ 
— /ņ/ before /k, g/, e.g. ten cups, ten giris /'ten „kaps, ‘ten 
\g3:1Z/. 
(As a result of word final assimilations, /n/ may be 
preceded by vowels other than /1, e, æ, b, A/. Thus /n/ 
can occur after long vowels as a result of assimilation (cf. 
§9.8), e.g. I’ve been /bizm/ gardening, She’ll soon 
/'suzn/ come, his own /'3UN/ car, etc.) 
Assimilations of /m/ to /n/ before /t, d, n/ and to /n/ 
before /k, g/ or of /n/ to /m/ before /p, b,m/andto /n/ 
before /t, d, n/ are abnormal or occur in speech charac- 
terized generally as slipshod. 
/s/ — /5/ before /f, tf, d3, J/, e.g. this shop, cross channel, this 
judge, this year /'ð1f fop, ‘krof ‘tfzen], ‘dif \d3ad3, ‘dif 
Ney 
However, /s/ shows no phonemic change before other 
consonants, e.g. /'018 \O1n/ (this thing) is abnormal; /f, 
$/, too, are not influenced to any marked extent by a 


'See also A. C. Gimson, ‘The instability of English alveolar articulations,’ Le Maître 
Phonétique, no. 113, 1960. 
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following consonant, but, in very rapid and informal 
speech, final /6/ may assimilate to /s/ before /s/, e.g. 
both sides /'baus \saidz/. 

/z/— /3/ before /J, tf, d3, J/ or sometimes — /f/ (change to fortis) 

before /f/, e.g. those young men /'ðəu3 jay \men/, 
cheese shop /“Si:3 fop/, those churches /'dauz \tf3:tfiz/, 
has she? /‘hee3 fi/ or /heef fi/. 
However, /z/ does not show similar changes before other 
consonants. The place of articulation of /v, 3/ is not 
usually influenced greatly by a following consonant but, 
in very rapid and informal speech, final /0/ may 
assimilate to /z/ or /s/ before /s, z/, e.g. I loathe singing 
/a1 louz \sinin/, and word initial /O/—especially in the 
article the—may assimilate to /s, z/ following /s, Z/, e.g. 
What's the time? /'wots za tam/, Has the post come? 
/‘heez zə ¡paus ¡kam/, in very rapid speech. 


In this connection, it is to be noted that the alveolars have a relatively 
high frequency of word final occurrence, especially when inflexional, 
and are particularly apt to undergo neutralization as redundant 
oppositions in connected speech. As always, phonemic oppositions 
having been neutralized, the sense of an utterance may be determined 
by the context, e.g. / ren \kwikli/ (ran or rang quickly), /‘raip ¡peoz/ 
(right or ripe pears or pairs), /laik \kri m/ (like or light cream), /‘hop 
mənjuə/ (hot or hop manure), /'peerif ‘fau/ (Paris Show or parish 
show), /‘wotf jo: \weit/ (what's or watch your weight),' or, with a 
neutralization to a labio-dental articulation, /‘greip ,vain/ (great or 
grape vine), [ram fa jo: mani] (run or rum for your money). 

When alveolar consonants /t, d, n/ are adjacent in clusters or 
sequences susceptible to assimilation, all (or none) of them will undergo 
the assimilation, e.g. Don’t /'daump/ be late, He won’t /'wəunk/ 
come, I didn’t /‘dignk/ go, He found /'faumb/ /both, a kind /‘kaiyg/ 
gift, red and black /'reb m blek/. 


(2) Coalescence of /t, d, s, z/ with /j/ 

The process which has led to earlier /t, d,s, z/ + /j/ giving /tf, d3, J, 3/ 
medially in a word (nature, grandeur, mission, vision—§8.3.1) may 
operate in contemporary colloquial speech at word boundaries, e.g.:— 


/t/ + /j/—What you want... /wotfu wont... ./ 

/d/ + /3/— Would you? /'wudzu/ 

/s/ + /j/—In case you need it /19 'keifu mi: d 1t/ 

/z/ + /1/—Has your letter come? /heezo: ,leta kam/ 
as yet /3:3et/ 


! If the utterance is reduced to /wotfo: weit/, there is also the possibility of the inter- 
pretation watch or wait. 


300 An Introduction to the Pronunciation of English 


The coalescence is more complete in the case of /t, d,/ + /j/ (espe- 
cially in question tags, e.g. didn’t you?, could you?, etc.); in the case of 
/s, z/ + /j/, the coalescence into /f, 3/ may be marked by extra length 
of friction, e.g. Don’t miss your train /'daump 'miffo: \trein/, cf. J 
can’t be sure /ai 'ka:mp b1\fo:/. 

In very careful speech, some RP speakers would use somewhat 
artificial, uncoalesced, forms within words, e.g. /‘neitjo, ‘kwestjon, 
An‘fo:tjunat, ‘sauldja/ (nature, question, unfortunate, soldier), etc. 
Such speakers would also avoid coalescences at word boundaries; 
other, careful, speakers who use the normal coalesced forms within 
words, would consciously avoid them at word boundaries. But see 
§11.6. 


(3) Progressive (or Perseverative) Assimilation is relatively uncom- 
mon. It may occur when a plosive is followed by a syllabic nasal and 
the nasal undergoes assimilation to the same place of articulation as 
the preceding plosive, e.g. /n/ — /m/ after /p, b/, happen, urban 
/hepm, 3: bm/; and /n/— /1/ after /k, g/ in second chance, organ 
/'seky \tfa: AR 


11.3.6 Elision 


Apart from word internal elisions (see $9.6) and those associated with 
weak forms, sounds may be elided in rapid, colloquial speech, espe- 
cially at or in the vicinity of word boundaries. 


(1) Vowels (a) Allophonic variation.—As in the case of consonants, 
variations in the production of vowels may be insufficient to cause a 
change of phoneme. When one syllable ends with a closing diphthong 
(i.e. one whose second element is closer than its first, in RP el, al, 91, 
34, au/) and the next syllable begins with a vowel, the second element of 
the diphthong may be elided. Word internal examples of the type 
discussed in §8.2.5 (e.g. hyaena /hari:na/ smoothed to [ha'izna]) may 
result in neutralization, thus layer /‘leia/ with smoothing = lair [lea], 
mower /'maus/ with smoothing = myrrh [moa] interpretable as /m3: /. 
Similar smoothing occurs across word boundaries, e.g. go away [,g3: 
\wel], [may as well [ai meəz ,wel], J enjoy it [ar in‘d3p 1t], try again [ tra 
a‘gen] or [,tra: *gen]. 

(b) Phonemic elision.—Initial /9/ is often elided particularly when 
followed by a continuant and preceded by a word final consonant (com- 
pensation for the loss of /a/ frequently being made by the syllabicity of 
the continuant), e.g. not alone / notl Maun/, get another /' getn \nada/, 
run along / ran] ‘lon/, he was annoyed /hı wəzn ‘noid/; or again, when 
final /9/ occurs with following linking /r/ (see 8] 1.3.7) and word initial 
vowel, /9/ may beelided, e.g. after a while /'a ftra\wall/, as a matter of 
fact /oz ə mætrəv ¡feekt/, father and son /'fa Oran \san/, over and 
above /'3uvran ə bav/. 
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(2) Consonants. —In addition to the loss of /h/ in pronominal weak 
forms and other consonantal elisions typical of weak forms,' the 
alveolar plosives are apt to be elided. Such elision appears to take place 
most readily when /t/ or /d/ is the middle one of three consonants and 
has the same voicing? as the first consonant of the sequence. Any con- 
sonant may appear in third position, though elision of the alveolar 
plosive is relatively rare before /h/ and /j/. Thus elision is common in 
the sequence voiceless continuant + /t/ or voiced continuant + /d/ 
(e.g. /-st, -ft, -$t, -nd, -ld, -zd, -dd, -vd/) followed by a word with an 
initial consonant e.g. next day, raced back, last chance, first light, west 
region, just one; left turn, soft centres, left wheel, drift by, soft roes; 
mashed potatoes, finished now, finished late, pushed them; bend back, 
tinned meat, lend-lease, found five, send round, dined well; hold tight, 
old man, cold lunch, bold face, world religion; refused both, gazed 
past, caused losses, raised gently; loathed beer; moved back, loved 
flowers, saved runs, served sherry, etc. Similarly, word final clusters of 
voiceless plosive or affricate + /t/ or voiced plosive or affricate + /d/ 
(e.g. /-pt, -kt, -tft, -bd, -gd, -d3d/) may lose the final alveolar stop 
when the following word has an initial consonant, e.g. kept quiet, 
helped me, stopped speaking, jumped well; liked jam, thanked me, 
looked like, looked fine, picked one; reached Paris, fetched me, 
reached Rome, parched throat; robbed both, rubbed gently, grabbed 
them; lagged behind, dragged down, begged one; changed colour, 
urged them, arranged roses, judged fairly, etc. (In the sequence /-skt/, 
/k/ rather than /t/ is often elided, e.g. risked prison, asked them.) 
Elision of final /t/ or /d/ is rarer before initial /h/, e.g. the alveolar 
stops are more regularly retained in kept hold, worked hard, East Ham, 
reached home, gift horse, rushed home, grabbed hold, round here, bald 
head, jugged hare, changed horses, raised hands, moved house, etc. 
Final sequences /-nt, -1t/ tend to keep either /t/ or [?], the appropriate 
reduction of preceding sounds being retained as a result of the final 
fortis plosive, e.g. in sent back, count them, can’t fail; felt bad, built 
near, salt lake, etc. Final /t, d/ followed by a word beginning with /j/ 
are usually kept in a coalesced form with /j/, 1.e. /tf/ or /d3/, e.g. 
helped you, liked you, lost you, left you, grabbed you, lend you, told 
you, etc. It will be seen that in many cases, e.g. in J walked back, they 
seemed glad, elision of word final /t/ or /d/ eliminates the phonetic cue 
of past tense, compensation for which is made by the general context. 
Such is the instability of the alveolar plosives in such a position of 
apparent inflexional significance that it can be assumed that the context 


See also §8.2.5. | 
2The negative contraction /-nt/ is an exception to this voicing rule. (See later in this 


paragraph). 
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regularly carries the burden of tense distinction. Where the juxta- 
position of words brings together a cluster of consonants (particularly 
of stops), elision of a plosive medial in three or more Is to be expected, 
since, because of the normal lack of release of a stop in such a situation, 
the only cue to its presence is likely to be the total duration of closure. 

The /t/ of the negative /-nt/ is often elided, particularly in disyl- 
lables, before a following consonant, e.g. you mustn’t lose it /jU ‘masn 
“luzz 1t/, doesn’t she know? /'dazn fi ,nau/, and sometimes before a 
vowel, e.g. wouldn’t he come? /'wudn 1 ,kam/, you mustn’t over-eat 
/ju'masn puvər i:t/. Less common is the omission of the stops in the 
negative /-nt/ component of monosyllables, e.g. he won’t do it /h1 
'woun du: 1t/; in this latter case, there isa tendency for /au/ to have full 
rather than reduced length because of the omission of the syllable-final 
fortis stop. Clusters of word final /t/ and word initial /t/ or /d/ are 
sometimes simplified in rapid informal speech, e.g. /’ve got to go /a1v 
¡gota *g2u/, what do you want? /'woda ju \wont/, and less commonly 
/d/ before /t/ or /d/, e.g. we could try /w1ka trai/, they should do it 
/Mel fa du: 1t/. The elision of one of a boundary cluster of two con- 
sonants sometimes occurs in very rapid speech, but is usually charac- 
terized as a vulgarism, e.g. he went away /hi ‘wen 9,we1/, I want to come 
/ai wona ¡kam/ (< /al ‘wonta ¡kam/, which frequently occurs), give 
me a cake 'gi mia \keik/, let me come in 'lemi ‚kam un/, get me some 
paper /'gemı sm \peip2/, as well as the most reduced forms of I'm going 
fo /aim gana, ainana, ainna/. 


11.3.7 Liaison 


As has been mentioned in $8.8(2)(a), RP retains word final post-vocalic 
/r/ as a linking form when the following word begins with a vowel, i.e. 
in those cases where an [r] sound existed in earlier forms of RP, as the 
spelling indicates. The vowel endings to which an /r/ link may, in this 
sense, justifiably be added are /-a:, -9:/ and those single or complex 
vowels containing final [a] (/a, 3:, 19, es, va/), e.g. in far off, four aces, 
answer it, fur inside, near it, wear out, secure everything. By analogy, 
this /r/ linking usage is extended to all /a:, 9:, 9/ endings, even when 
there is no historical (spelling) justification. Such intrusive /r/s are to 
be heard particularly i in the case of /a/ endings, egi Russia and China 
/ rafar on tfamo/, drama and music /'dra:mər ən 'mju:zık/, idea of 
/aidiar av/, India and Pakistan /'mdiər ən ¡pa:ki'sta: n/, area of agree- 
m a er ə'gri:mənt/, etc. Less frequently, analogous links 
(unjustified by the spelling) are made with final /a:, 3t/ e.g. Shah of 
Iran /'fazr əv rra:n/, law and order /'l>:r and ‘9: də/, awe-inspiring 
/'3: r n,spatarin/. It is clear that the RP system of linking /r/s strongly 
encourages the creation of analogous links in similar phonetic contexts. 
Spelling consciousness remains an inhibiting factor, but the present 
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general tendency among RP speakers is to use intrusive /r/ links after 
final /a/, even—unconsciously—among those who object most 
strongly. Words containing final /a:/ or /o:/, without an earlier form 
with /r/, are less common than those with /2/. The pressure of analog y 
to use an intrusive /r/ might in consequence appear to be greater; but 
the comparative rarity of the cases of such intrusive /r/s tends to make 
speakers more aware of the ‘correct’ forms; thus / saw it /al 'sozr 1t/, 
drawing, /'dro:rim/, are generally disapproved of, though it is likely 
that many RP speakers have to make a conscious effort to avoid the use 
of such forms. The focusing of attention on this particular type of 
analogous formation as an undesirable speech habit has led to the use 
by some speakers of a pause or glottal stop in such critical cases of 
vowel hiatus, with the result that, in avoiding ‘intrusive’ /r/s, they have 
also abandoned justifiable linking /r/s in favour of a vowel glide or 
glottal stop, e.g. in secure it, I’m sure it does, War and Peace, winter 
evening. If such a situation had arisen in a freer stage of the develop- 
ment of the language, 1.e. when spelling forms were not so fixed or 
influential as at present, it is possible that the intrusive /r/s would have 
been adopted as regular usage and the spelling modified. As might be 
expected, in those regions where post-vocalic /r/ is pronounced and 
pour, paw identified as separate word forms in isolation, the tendency 
to introduce analogous linking /r/s is less marked than in RP orin RP- 
influenced types of speech. 

It should be noted that, in synchronic terms, the same process is in 
Operation whether the /r/ link inserted is historically justified (linking) 
or not (intrusive). The examples below demonstrate that the environ- 
ment is phonetically comparable whether the /r/ link is inserted before 
a suffix or before a separate word and whether it is linking or intrusive. 


Stir stirring stir it in 


/st3: ‘st3irm Sart n 

dear dearer my dear Anna idea of it 

/dio  'diora mai dior ‘genoa al dior av 1t/ 

roar roaring roar angrily raw egg strawy 
Vio forrin 'rorr ‘eengrali Toir eg ‘Stra ri/ 
star starry a star in the sky the spa at Bath Schwaish 
/sta: 'sta:ri ə 'stazr in 99 'skar da ‘spatr ət ‘ba:@ 'fwairif/ 


There appears, however, to be some gradation in the likelihood of 
occurrence as follows: 
(1) Where a word ends in a non-high vowel, the insertion of /r/ is 
obligatory before a suffix beginning with a vowel. (See §9.10.) 
(2) Before another word the insertion of such an /r/ is optional. 
However, it occurs in the vast majority of cases where a historically 
justified linking /r/ is possible. 
(3) Where the /r/ link is intrusive, speakers tend to use it after /9/ (e.g. 
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vanilla essence /va nilar 'esans/, vodka and tonic /'vogkar an tonik/) 
more readily than 

(4) after /a:/ or /o:/ (e.g. nougat and chocolate 'nu:ga:r an tfoklit/, 
straw in the wind /'stro:r in 6a ‘wind/). 


There is considerable resistance to 


(5) the insertion of intrusive /r/ before a suffix (e.g. strawy /'stro:r1/. 

Such avoidance occasionally leads to the omission of the normal 
(citation form) /r/ in a word such as assurance, giving rise to a from 
form /a forans/ which may be regarded as hypercorrection. When the 
fear, with some speakers, of using the intrusive /r/ inhibits liason 
across word boundaries, a vowel glide or a glottal stop may be used, e.g. 
the door opened /6a 'da:waupand/ or [ðə da: \2aupand]. 

Finally, it may be added that it is usual (and not stylistically signi- 
ficant) in connected speech for the linking /r/ form of words to be used 
before a vowel, e.g. thanks for everything /far/, my father and mother 
/'fa:dar/, the weather ought to improve /wedar/, here and there 
/hior/, I don’t care if they do /kear/. 

Phonetically (as well as historically) the resulting /r/ closes the 
syllable rather than being initial in the next, e.g. the /r/ of more ice 
/'mo:r ‘ais/ is shorter than that of more rice /'ma: 'ra1s/, the latter also 
being associated with stress onset and possible pitch change. (cf. 
§11.3.8.) 

It is unusual for a word final consonant to be carried over as initial in 
a word beginning with an accented vowel, the identity of the words 
being retained (see §11.3.8). Thus, run off, give in, less often, are rarely 
/ ra ‘nof, givin, ,le‘sofn/; more particularly, the fortis plosives do not 
acquire aspiration such as would accompany their shift to an accented 
syllable-initial position, e.g. get up, look out, stop arguing, are not 
usually [ige “Pap, lu ‘k'aut, ¡sto “pta: gjuinj. One or two phrases in 
common use do, however, show such transference, e.g. athome, not at 
all, often pronounced [a ‘t"aum, not a ‘t"s:t]; they may be considered as 
constituting, in effect, composite word forms. 


11.3.8 Juncture 


Despite the fact that the word may have its isolate-form identity con- 
siderably modified by its immediate phonemic and accentual context, 
both as regards its constituent sounds and its accentual or rhythmic 
pattern, phonetic features may be retained in the speech continuum 
which mark word or morpheme boundaries. Thus, the phonemic 
sequence /p1:sto:ks/ (with secondary accent on the syllable containing 
/d:/) may mean pea stalks or peace talks according to the situation of 
the word boundaries (i.e. /1:-st/ or /1:st-t9:/). In this case, if the 
boundary occurs between /s/ and /t/, the identity of the words peace 
and talks may be established by the reduced /1:/ (in a syllable closed by 
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a fortis consonant) and by the slight aspiration of /t/ (initial in a 
syllable carrying a secondary accent); on the other hand, if the 
boundary occurs between /i:/ and /s/, this may be signalled by the 
relatively full length of /i:/ (in an open word-final syllable) and by the 
unaspirated allophone of /t/ (following /s/ in the same syllable), as 
well as by the stronger /s/. Such phonetic differentiation depends upon 
the speaker’s consciousness of the word as an independent entity. Or, 
alternatively, distinctive function of phonetic features in the continuum 
may be said to be associated with the type of juncture (close or open) 
which occurs; thus, in pea stalks, open (i.e. as before a pause) juncture 
relates /1:/ to /s/ and close (i.e. as within a word) juncture relates /s/ to 
/t/, whereas in peace talks, close juncture relates /1:/ to /s/ and open 
juncture /s/ to /t/, with the relevant phonetic cues. If the two 
utterances were not distinguished in such terms, it would be necessary 
to postulate, for instance, a phonemic opposition between the full and 
reduced forms of /1:/ and between the aspirated and unaspirated types 
of /t/. A phonemic transcription which follows the orthographic con- 
vention in showing spaces or hyphens. between words retains the 
identity of the word but obscures the continuous, unbroken nature of 
connected speech. The following examples illustrate various ways in 
which phonetic cues may mark word boundaries 


/aneim/—a name (relatively long /n/, associated with stress onset 
and possible pitch change) 
an aim (relatively short /n/, stress and pitch change 
beginning on /e1/) 
/Oetstaf/'—that stuff (unaspirated /t/, strong /s/) 
that’s tough (aspirated /t/, weaker /s/) 
/daweitakatit/'—the waiter cut it (reduced /e1/, rhythmic groups 
/da'weita- katit/) 
the way to cut it (long /ei/, rhythmic groups 
/da wel-ta katit/) 
/aiskrizm/—IJ scream (long /ai/, strong /s/, relatively little 
devoicing of /r/) 
ice-cream (reduced /al/, weaker /s/, devoiced /r/) 
/haustreind/—how strained (long /au/, strong /s/, little devoicing 
of /r/) 
house trained (reduced /au/, weaker /s/, devoiced 
/T/) 
/waitfu:z/—white shoes (reduced /al/, long /f/) 
why choose (long /ai/, short [f] element of /tf/ 
affricate) 


! Examples from Daniel Jones, ‘The word as a phonetic entity,’ Le Maitre Phonétique, 
no. 36, 1931, and ‘The hyphen as a phonetic sign’, Zeitschrift für Phonetik, 1956. 
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Similarly, simple word entities may be distinguished from words 
composed of separable morphemes:— 


/hainis/'—highness (/ai/ and /n/ in close juncture, rhythmical 
shortening of /a1/) 
high-ness (/ai/ and /n/ in open juncture, full length of 
/a1/) 
/naitreit/*—night-rate (/t/ and /r/ in open junctural relationship, 
little devoicing of /r/) 
nitrate (/t/ and /r/ close junctured, devoiced /r/) 


It is to be noted, however, that such junctural cues are only poten- 
tially distinctive and, in any case, merely provide cues to word identi- 
fication additional to the large number already contained in any 
utterance. Junctural oppositions are, in fact, frequently neutralized in 
connected speech or may have such slight phonetic value as to be 
difficult for a listener to perceive; they may, therefore, besaidtobeona 
lower plane of relevance than the phonemic units conveyed by qualities 
and the various accentual patterns. 


11.4 Frequency of Occurrence of Monosyllabic and Polysyllabic 
Words 


In a running text of a conversational kind, the following approximate 
percentages of occurrence of words containing different numbers of 
syllables are to be expected: 1 syllable—81%%; 2 syllables—15%; 3 
syllables—3%. The remaining 1% of words Have 4 syllables or more, 
those with 5 or more syllables accounting for a minute proportion of the 
total word list. If the 1,000 most common words used are examined,? 
it has been calculated that some 15% admit of the kind of phonemic 
variability mentioned in §9.11. Half of such words permitting 
phonemic variation are monosyllables whose phonemic structure 
depends upon the degree of accent placed upon them. 


11.5 Advice to Foreign Learners 


The foreign learner need not attempt to reproduce in his speech all the 
special context forms of words mentioned in the foregoing sections. He 


! From Daniel Jones, ‘The hyphen. . .,’ op. cit. 

¿From Trager and Bloch, ‘The syllabic phonemes of English,’ Language, 17, 1941; 
also Trager and Smith, An Outline of English Structure, 1951; 1. Lehiste, ‘An acoustic- 
phonetic study of internal open juncture’, University of Michigan, 1959. 

3See my ‘Note on the variability of the phonemic components of English words,’ 
(1969), op. cit. 
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must, however, observe the rules concerning weak forms, should 
cultivate the correct variations of word rhythmic patterns according to 
the context, and should make a proper use of liaison forms (German 
speakers, in particular, should avoid an excess of pre-vocalic glottal 
stops). In addition, he should be aware of the English assimilatory 
tendencies governing words in context, so as to avoid un-English 
assimilations such as / like that /ai ‘laig ,Ozt/ (incorrect voicing) or / 
was there /al wa0 MWea/ (incorrect dental modification of the place of 
articulation). If his speech is to be a perfect reproduction of that of the 
native speaker, he must use the special assimilated and elided word 
forms used in context which have been described above. In any case, 
whether or not he uses such forms himself, he must know of their 
existence, for otherwise he will find it difficult to understand much of 
ordinary colloquial English. This knowledge is particularly important 
because a second language is often learned on a basis of isolate word 
forms; in the speech of the native, however, the outline of these words 
will frequently be modified or obscured, as has been seen. 

The foreign learner is recommended to aim at a careful colloquial 
style of English in his own speech and, at the same time, to be aware of 
the features which characterize the rapid colloquial (or familiar) style 
he is likely to hear from native speakers. The following dialogue 
illustrates some of the differences which may be found between the two 
styles, (1) being careful colloquial and (2) rapid familiar: — 


A. What do you think we should do this evening? 
(1) ‘wot du ju  '0mk wi(:) fud du: ðs ¡¡:vnin 


(2) 'wod3u ‘eink wi fad du: ds ivn 
B. How many of us will there be? 

(1) 'hau meni 'bv əs wil da Di 

(2) ‘hau mni ov as 1 6a bi: 


A. There are the two of us, and probably the two girls 
(1) der ə Oa ‘tu: av ,as, ond ‘probabli do tu: 'g3:lz 


(2) doar 9 da ‘tu: ov /AS, m 'probbl1 do ‘tu: ‘g3:lz 


from next door. That’ll be four of us already 
(1) from ‘nekst ido: Oetl bi “forr av əs ə:Dredı 
(2) frm meks do: ,detl bı forr av as o:‘redi 


I think they ’re a nice young couple, don’t you? 
(1) an Oink ¡en or ə Mais jay ¿Kapl, daunt Ju: 
(2) a(Qi) Oink dear 2 ‘naif fan ¿Kapl, daun,tfu: 
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B. I’ve only talked to them once, but they seemed nice 
(1) aiv unli ‘tozkt tə dam ‘’wans,bat dei ‘si:md_ ,nais 
(2) a(mv unt Mto:k ta Om  ,wans,bot ðe(1)`siim ,nais 


I wonder if we should go to the theatre 
(1) at ‘wander if wi fud gov tə oa neta 
(2) a(t) 'wandr af wi fg gov to 09  Bneta 


I can try and book some seats round the corner 
(1) at kon ‘trai ən buk səm ,si:ts raund 03 `kə:nə 
(2) afi) kn Mtra: m  buk sm ¿Stits raun Oa ‘korno 


11.6 Stylistic Variation 


It should be stressed that all the features of connected speech discussed 
in this chapter are common in the normal, fluent speech of native 
speakers of English and that the lack of such features would be 
abnormal and artificial. As throughout the rest of this book, the 
descriptive emphasis has been on RP which, it may be seen, is not a 
monolithic accent but which displays considerable variation. Many 
factors influence this variation and one which may usefully be singled 
out 1s style of discourse. 

It is important to avoid equating ‘casual’ and ‘rapid’, since slow 
speech is possible in a casual situation and rapid utterances can occur in 
more formal circumstances. It should also be borne in mind that the 
average rate of delivery differs from speaker to speaker regardless of 
style of speech. Thus the degree of formality and the rate of utterance 
are independent variables. The generalizations which follow are based 
on data drawn from twenty hours of tape-recorded spontaneous con- 
versation concentrating on the speech of six RP speakers.! Rate of 
utterance was not found to be a function of style, nor was style defined 
in terms of pace.* Rather, social situations were set up that were 
designed to be markedly formal (e.g. an undergraduate in conversation 
with a professor who was a Stranger to her) or markedly casual (e.g. a 
man in conversation with his wife). All speakers were recorded in both 
formal and casual situations and the speech of all of them exhibited the 
same trends to differing degrees. 


(1) Stress and intonation 


In all styles of speech, simple falls in pitch (whether from a high or a mid 
starting point) account for the majority of nuclear tones, between 60% 
and 70% in most conversation. The falling-rising nuclear tone accounts 


' For details of this research, see Ramsaran (1978). 
¿The slowest rate of utterance recorded in conversation was 189 sylls/min (3-1 
sylls/sec, 7-6 segs/sec) and the fastest was 324 sylls/min (5-4 sylls/sec, 13-4 segs/sec). 
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for roughly 20% on average. Thus it may be seen that speech exhibiting 
a large number of rises or rise-falls would be conspicuous in this 
respect. In general, intonation patterns show no marked distribution in 
casual speech. However, formal speech shows some concentrations of 
one pattern, whether repetitions of fall-rise nuclear tones or a stretch of 
speech with repeated rising nuclear tones, e.g. If you pull them ,off | 
and put them in a glass of ,water | they “grow little roots | and then | 
you plant them in ,soil | and they ¡grow | and then you’ve got another 
spider plant. 

Casual speech has longer intonation groups and contains fewer 
stressed syllables than formal speech. 


(2) Weak forms 


The use of strong and weak forms does not appear to be a matter of 
style except insofar as the more frequent occurrence of strong forms in 
more formal situations results from the additional stresses of formal 
rhythms. The alternation of strong and weak forms is entirely regular in 
both formal and casual styles of speech: weak forms occur unless the 
grammatical word is stressed. Since stresses are more frequent in the 
intonation groups of formal speech, strong forms may occur more 
often. 


(3) Linking /r/ 


As with weak forms, linking /r/ is frequent in all styles of speech, 
though an /r/ link is not necessarily used on every occasion where such 
an insertion would be possible. (See $11.3.7.) Its occurrence is of no 
stylistic significance. (The avoidance of intrusive /r/ results from a 
deliberate carefulness shown by some speakers in more formal speech.) 


(4) Assimilation 


Assimilations occur in all styles of speech. However, unassimilated 
forms occur more often than assimilated forms which tend to increase 
in frequency in the more casual style of speech, regardless of pace. 

The fact that rate of utterance has no direct effect on the use of 
assimilation may be illustrated by examples taken from the conversa- 
tion of a single speaker who has /d3af fatin/ for just shutting (also 
exhibiting elision of /t/) when speaking at a medium pace in acompara- 
tively formal situation, but /ho:s fau/ for horse show when speaking 
very rapidly in a casual situation. 

In general, although all types of anticipatory de-alveolar assimila- 
tion do occur, speakers use palato-alveolar assimilations (of the kind 
/spelf Jatl/ for space shuttle) and bilabial assimilations (of the kind 
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/ðæp p3:sn/ for that person) far less commonly than they use velar 
assimilations (of the kind / fork kat/ for short cut). Such velar assimila- 
tion is also more common than coalescent assimilations (such as /d/ + 
/j/ — /d3/ asin /nautid3otsman/ for noted yachtsman or /Z/ + /]/—= 
/3/ as in /bika3u: / for because you must). However, coalescence is 
frequent in common phrases such as the auxiliary verbs + pronouns of 
did you, can’t you etc. /didzu:, ka:ntfu:/ and occur even in very 
formal conversation, e.g. Would you like a cup of tea? /'wudzu: ‘laik ə 
'kap av yti:/. 


(5) Elisions 


Elisions do show some correlation with pace. In all styles they become 
more frequent as the rate of utterance increases; but, whereas in formal 
speech they are almost entirely regular (e.g. alveolar plosives may be 
elided interconsonantally, /a/ in pre-nuclear unstressed syllables, and 
/h/ in unstressed non-initial grammatical words), in casual speech they 
are less rule-bound. Casual speech may contain unpredictable elisions 
such as those of /I/ and /0/ in Well, that’s all right /we ets 9 : ralt/. 


(6) Co-occurrence of phonemic features of connected speech 


It should be noted that the occurrence of /r/ links, elisions and 
assimilations is optional in the sense that when the appropriate phonetic 
environments occur, these processes may Or may not operate. The pre- 
ceding sections (1)—(5) indicate some tendencies in the likelihood of 
occurrence. If, then, such processes do operate, they will follow the 
regular patterns described in §§11.3.5—7. Disregarding the occasional 
irregular elisions that may occur in casual speech, it should be added 
that an utterance not uncommonly contains instances of both assimila- 
tion and elision in conjunction with each other, since alveolar conso- 
nant clusters are not infrequent in word-final position: after the elision 
of a final /t/ or /d/ the remaining fricative or nasal may be assimilated 
to the place of articulation of the initial consonant of the following 
word, e.g. closed shop /klauzd fop — klauz fop — klau3 fop/, hand 
made /heend meid —heen meid — heem meid/ and just shutting /d3ast 
Satin — d3as fatin —d3Af fatin/. The conjunction and has a common 
weak form with /d/ elided and the /9/ may also be elided, leaving the 
nasal to function syllabically, particularly after plosives. The resulting 
syllabic /n/ is itself susceptible to assimilation, which accounts for such 
pronunciations as /'wug y “gla:s/ (wood and glass). 


(7) Plosive release 


One of the most stylistically significant variables concerns a non- 
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phonemic variation in the release of plosives, particularly the fortis 
series. Not only is this a matter of allophonic realization, but it also cuts 
across phonemic boundaries in that it applies to /p, t, k/ and, toa lesser 
extent, /b, d, g/. As is explained in §8.2.3(2), a plosive usually has an 
inaudible release when followed by another stop consonant. However, 
in the most formal social situations, there is a marked increase in the 
number of audibly released plosives in such a context, e.g. [ai lukt" 
\kwizikal]. 


Summary 


No feature is unique to a variety, but some features are distributionally 
marked. In particular, pre-consonantal plosives may be audibly 
released when speech is formal, whereas assimilations increase in 
frequency as speech becomes more casual. Elision is the only feature 
that bears a definite relation to pace, occurring more frequently as pace 
increases, however formal the situation may be. There is more fluctua- 
tion of pace in casual speech. 

Since assimilations, elisions and /r/ links occur with some frequency 
in all situations whether formal or casual, it is clearly important that 
foreign learners should cultivate these features in their speech. 


Appendix: 12 


Teaching the Pronunciation of 
English 


12.1 The Place of Pronunciation 


In the preceding chapters, the chief features of English pronunciation 
(especially of RP) have been set out in some detail. Clearly a foreign 
learner who requires an adequate performance in the language for the 
practical purposes of everyday communication will not need to master 
all the variants described. Nevertheless, any teacher or learner must 
consider how much of the time given to the acquisition of another 
language should be devoted to pronunciation and what level of perfor- 
mance is necessary for efficient communication. 

12.1.1 It can be claimed that speech, like writing, constitutes no 
more than the transmission phase of language, providing a signalling 
system for the language’s more essential store of items defined in the 
lexicon and of syntactic rules contained in.the grammar. Yet high 
adequacy in lexis and grammar can be negated by incompetence in the 
signalling phase, when the prime medium is speech. Thus, unless a 
learner expects to deal with English only in its written form, there is no 
escape from the acquisition of at least the rudimentary elements of 
English pronunciation. Such a conclusion implies that in any course of 
English a realistic amount of time should be devoted to practice in the 
spoken language. 

12.1.2 However, the teaching of pronunciation presents particular 
difficulties. Grammatical structures can be ordered and taught in 
sequence; a vocabulary compiled ona basis of frequency of occurrence 
can be utilized for the presentation of early grammatical structures, 
with the addition of special sets of lexical items as situations or special 
purposes require. Pronunciation, on the other hand, does not permit 
such progressive treatment, since all phonetic/phonological features 
are potentially present from the very first lesson, unless vocabulary 
items are artificially introduced. Nevertheless, the teacher must deal 
systematically with the teaching of pronunciation, even though he may 
be forced to postpone the correction of some mistakes which occur in 
the early stages. In organizing his teaching he will require answers to 
such questions as: 
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What form of pronunciation is to be taken as model? 

What level of performance is to be aimed at? 

Can the difficulties of English pronunciation be ordered? 

What general principles should underlie the teaching of pronuncia- 
tion? 


12.2 The Choice of Models of Pronunciation 


This is a matter of special importance as far as English is concerned 
because of the world-wide use of the language and because of the 
profusion of differing spoken forms existing not only in such mother- 
tongue areas as Britain, North America and Australasia but also in 
those vast regions of India and Africa where English is used as an 
adopted lingua franca (see §6.3.5). 


12.2.1 The Native Speaker 


In normal circumstances, the Englishman or the American, for 
instance, will by early adulthood be making habitual use of one speech 
form determined mainly by his family background and by his social 
environment. Such variations in his pronunciation as occur are likely to 
be the result of differences in situation. It will no doubt be possible to 
identify the phonetic and phonological features which characterize his 
pronunciation when he is making a formal speech or when he is talking 
to children or when he is influenced by anger or tenderness, and so on. 
Such phonetic indices are of course accompanied by appropriate lexical 
and grammatical variations of usage. The pronunciation effects of 
stylistic variation are shown particularly in changes of tempo and 
intonation and in the use made of elision and assimilation (see § 12.7.3). 
In addition, some speakers will tend (often unwittingly) to adapt their 
own speech to some extent to that of their interlocutor, e.g. by making 
concessions to the other’s social or regional accent. Thus, in speaking to 
an American, an Englishman might use the pronunciations / skedju:], 
praimeroalhi/ for schedule, primarily in preference to his usual / fedju:1, 
‘praimorali/. Or again, even amongst his own countrymen he may feel 
that his usual /di'spju:t/ for dispute (n.) is inappropriate in the com- 
pany of those engaged in industrial strikes for whom /'dispju: t/ is the 
(recent) preferred form. 

However, although the native speaker generally has a basic stability 
in his habits of speech production, he has considerable ability in the 
reception and comprehension of other forms of English which show 
marked divergencies in their phonetic and phonological characteristics. 
There can be no doubt that such receptive efficiency has been brought 
about by half a century of aural exposure through radio and television 
in particular to most of the important spoken forms of the language. 
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Thus, the Londoner will now have little difficulty in understanding the 
commoner forms of American English, although he may experience 
some difficulty with the more extreme, popular types of, say, Glasgow 
or Belfast speech. 


12.2.2 The Foreign Learner 


It is rare for the foreign learner, usually acquiring English in an arti- 
ficial and intermittent fashion, to approach the native’s receptive and 
productive competence. Indeed, it can be argued that only excep- 
tionally is it necessary to have such an aim. But, whatever abilities the 
learner may acquire in the later stages of learning English, he will be 
well advised at the beginning to model his productive performance on 
but one type of spoken English, without any conscious attempt to alter 
his pronunciation according to style or situation in the way that the 
native speaker does, but restricting himself to what has been called a 
‘careful, colloquial’ style (see $11.5). As he gains confidence, pro- 
ductive precision and fluency in the single style, he can for the purposes 
of widening his receptive competence gradually be exposed to other 
important regional types. If his introduction to English is via a British 
spoken form, he might, when this production habit is firmly estab- 
lished, be given opportunities to gain experience in receiving a widely 
accepted form of American English such as may be heard from news- 
readers on the national networks (see $12.9.5). 


12.3 Choice of Basic Model 


Faced with the great diversity of English accents, the foreign learner 
might wish that there existed a neutral, all-purpose, international pro- 
nunciation of English. It is claimed by some that as a result of the great 
improvement in communications in this century the present divergen- 
cies in English as 1t is spoken throughout the world (see also $6.3.5) will 
gradually be eliminated and there will emerge a universally intelligible 
pronunciation of English which all can use. The present evidence 
suggests, if the total English-speaking population of the world is taken 
into account, that this natural and desirable process as yet makes no 
advance. If there is to be an acceptable international pronunciation of 
English in the foreseeable future, it seems likely that it will have to be 
artificially formulated and disseminated! (§12.8). There are some 
formidable difficulties which beset the acceptance of a solution of this 
kind, not least of which is the native speaker’s inability to use a type of 


! I have attempted in ‘Towards an international pronunciation of English’, pp 45- $3. 
In Honour of A. S. Hornby (Oxford University Press, 1978) to present some theoretieal 
examples of such artificial pronunciation systems for English. 
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hybrid English pronunciation which only partially resembles his own. 
The more realistic, immediate solution lies in the choice of one of the 
main natural forms of English as the basic model, e.g. a representative 
form of British or American pronunciation, with the possibility of the 
selection for geographical reasons of such other (numerically minor) 
forms as Australian or South African. The advantage of such choices is 
that these forms (in a standard variety) are without difficulty mutually 
intelligible. The decisive criteria in the choice of any teaching model 
must be that it has wide currency, is widely and readily understood, is 
adequately described in textbooks and has ample recorded material 
available for the learner. It is clear that, if these criteria are admitted, 
British RP is an important candidate as a basic model which has con- 
siderable prestige and is already taught throughout the world. 


12.4 A Wider-based RP 


The emergence of RP as a standard pronunciation has already been 
given some treatment (§6.3.2), together with a preliminary assessment 
of the present-day situation. It is not sufficient, however, for the 
foreign learner to be presented with a general account of British 
attitudes to a standard of pronunciation. He requires, on the one hand, 
advice as to which English speech form is appropriate for him and, on 
the other, wherever this is possible, guidance as to the acceptable 
tolerances within the form, 

RP must be regarded as an evolving mode of pronunciation in its 
phonological system, its phonetic realization and the incidence of its 
phonemes. Certainly the specification of RP as the property of a single 
social class within a restricted geographical location is no longer valid. ! 
It is now realistic to allow considerable dilution in the original concept 
of the RP speaker, with the consequent admission into the permitted 
speech forms of certain variants until recently regarded as regional. 
Nevertheless, the essential regional base (historically) of the pronuncia- 
tion regarded as ‘standard’ or ‘received’ remains the educated speech of 
the South East of England, as described in the preceding chapters. It is 
this system which is recommended in general terms to the foreign 
learner, though certain optional items such as /hw/ (§8.9.2) may be 
firmly excluded and the incidence of /a/ and /1/ less rigidly prescribed 
(see §12.7.1(b)). Where a choice is related to the usages of different 
generations, the learner is always advised to adopt the form commonly 
found amongst speakers of the middle generations, i.e. the type of RP 
referred to in earlier chapters as ‘general’. Similarly, the learner can be 
permitted some latitude in the phonetic realization of the RP 


! See the ‘Introduction’ to the English Pronouncing Dictionary (14th edition, Dent, 
1977) where I have sought to provide a less restrictive definition. 
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phonemes, especially as far as vowel articulations are concerned (see 
$12.7). The extent of permitted phonetic and phonemic latitude will 
depend upon the level of performance at which learners, with their 
differing needs, will be aiming. 

It can of course be claimed that the traditional concept of RP suffers 
such dilution as a result of the tolerances suggested that a new label 
should be applied to the model. ‘General British’ (GB) has been used! 
and may in time supersede the abbreviation RP. But so widespread in 
Britain and abroad is the use of the term RP that it is retained in this 
discussion. 


12.5 Performance Targets 


Factors which will obviously determine the learner’s aims are con- 
cerned with his age and his natural ability, and with his motivation and 
the use to which he intends to put the language. He may succeed merely 
in speaking English with the phonetic and phonological system of his 
own language, in which case he is likely to be totally unintelligible to 
most native English listeners or, at best, comprehensible only to the 
extent that a small number of information points can be decoded as a 
result of the general context of the situation. If an attempt is made to 
approximate to native English speech forms, the achievement may lie 
somewhere between two extremes. The lowest requirement can be 
described as one of minimum general intelligibility, 1.e. one which 
possesses a set of distinctive elements which correspond in some 
measure to the inventory of the RP phonemic system and which is 
capable of conveying a message efficiently from a native English 
listener’s standpoint, given that the context of the message is known 
and that the listener has had time to ‘tune in’ to the speaker’s pro- 
nunciation. At the other extreme, the learner may be said to achieve a 
performance of high acceptability, 1.e. a form of speech which the 
native listener may not identify as non-native, which conveys informa- 
tion as readily as would a native’s and which arrives at this result 
through precision in the phonetic (allophonic) realization of phonemes 
and by confident handling of accentual and intonational patterns. 

It must be admitted that the great majority of foreign learners will 
havea Severely practical purpose for acquiring English and will perceive 
no important benefit in approaching the performance of the native 
speaker. Indeed, the learner may derive some advantage in retaining a 
clearly foreign image in his dealings with English speakers, e.g. the 
waiter or the taxidriver, whose discourse with a native speaker is likely 
to be limited and stereotyped, is unlikely to need to progress beyond a 


! For instance, by J. Windsor Lewis in his Concise Pronouncing Dictionary of British 
and American English (Oxford University Press, 1972). 
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level of basic intelligibility. Too much should not be made of phonetic 
and phonological niceties if this is the goal. Such is the redundancy 
present in natural languages that, within a defined context and with an 
adequate stock of words and basic syntactic patterns, considerable 
intelligibility can be achieved even though the pronunciation is 
seriously deformed. (For instance, at a dinner table, someone who asks 
for the potatoes—pronounced as [ba de:do:z]—is likely to be under- 
stood without difficulty, though it should be noted that an accentually 
divergent form such as [ bode:do:z] might well perplex.) 

There will however be many learners who, for academic reasons or 
because their work requires them to deal on equal terms with native 
speakers in Britain or abroad or even as tourists, wish to communicate 
easily without signalling too blatantly their foreign origin. For them, a 
mastery of English pronunciation and a ready understanding of a 
variety of English accents will not be enough; they must also expect to 
have a comprehensive command of syntax, including everyday ellipti- 
cal structures, a wide-ranging vocabulary (containing new or fashion- 
able locutions) and a deep acquaintance with English culture. The 
foreign teacher of English constitutes a special case. He has the obliga- 
tion to present his students with as faithful a model of English pronun- 
ciation as is possible. In the first place, and particularly if he is dealing 
with young pupils, his students will imitate a bad pronunciation as 
exactly as they will a good one; and, secondly, if he is using illustrative 
recorded material, his own pronunciation must not diverge markedly 
from the native model. 

There remains one further and separate category of competence in 
pronunciation which seeks neither to imitate a natural model nor to 
have any international validity: 1t may be termed a level of restricted 
intelligibility. However undesirable such a level may be thought to be 
when the efficacy of English as a world language is in question, there is 
no doubt that for most of those who use English as a second language— 
as a lingua franca within their own country having a number of indig- 
enous languages none of which is acceptable as a national language— 
this is a reality, especially within the continents of Africa and India. 
Such forms of English conform in all important features of lexis and 
grammar to the native language of Britain or America and may thus in 
their written form pose no problems of international intelligibility. It is 
in the spoken form of transmission that phonetic and phonological 
interference from the indigenous languages may erect a formidable 
barrier for listeners from communities where English is a native lan- 
guage. If the interference is such that no attempt is made to do other 
than use the sound system and prosody of the indigenous language, 
however effective this may be within the country concerned, com- 
munication with native English speakers may break down. 
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12.6 Priorities 


The various elements of English pronunciation will offer differing 
degrees of difficulty according to the linguistic background of the 
learner. Teaching should obviously be concentrated on those features 
of English which are not found in the learner’s native language. There 
are, however, certain basic characteristics of pronunciation which seem 
sufficiently specific to English to constitute a priority for the great 
majority of learners. 


12.6.1 Accentuation 


The stress-timed rhythm of the English utterance (be it a single word or 
connected speech) with the related obscuration of weak syllables is the 
prime distinguishing feature of the language’s pronunciation, with no 
exact parallel in any other language. In fact, in connected speech, 
unaccented syllables considerably outnumber those carrying primary 
or secondary accents. For some learners, e.g. with French or an Italian 
language or an African tone language as background, the problems of 
English accentuation are especially great and will require prolonged 
emphasis. For all learners, accentuation must provide the foundation 
on which any pronunciation course is built. The great variety of 
accentual patterns in English words must, of course, be given an impor- 
tant and early place in any teaching programme. However, it is worth 
keeping in mind that, in ordinary connected speech, monosyllables 
account for more than 80% of words occurring, with words of two 
syllables coming next with 15%, three-syllable words with nearly 3% 
and longer words occurring comparatively rarely. 


12.6.2 Segmental systems 


(a) It should be noted that in running English speech vowels account 
for some 40% and consonants 60% of the phonemes uttered (§§7.4, 
8.10). The fact that RP /a/ and /1/ have a frequency of occurrence of 
10-74% and 8-33% respectively, whereas /91/ and /ua/ are compara- 
tively rare, may assume some Importance in the construction of drills. 
Similarly, it is noteworthy that among the consonants the alveolar 
phonemes occur most frequently, whereas /3/ is relatively rare. 
Although the fact that /0/ occurs ten times more frequently than its 
fortis counterpart /0/ can be accounted for by the frequent occurrence 
of such grammatical items as the or that (as pointed out in §8.10), the 
discrepancy may seem to indicate the relative weight which should be 
given to /0/ in teaching. On the other hand, /8/ occurs more frequently 
in accented (and therefore prominent) positions than /0/. 

(b) Whatever the relative frequency of actual occurrences of the 
items contained in the phoneme inventories, the full systems (20V and 
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24C) must be regarded as complex compared with the systems of many 
other languages. In particular, the opposition of the close vowels /i:/~ 
/1/, /u:/ N /u/, the existence of a central long vowel /3:/ and the 
delicately differentiated front vowel set of /1:/ N /1/ N /e/ N /e/ + 
/A/, together with the significant or conditioned variations of vowel 
length, will pose problems to many foreign learners. Similarly, in the 
consonant system, English has a comparatively rare complexity in its 
set Of fricative places of articulation—labio-dental, dental, alveolar, 
palato-alveolar and glottal ($8.4(1)). Again, although English shares 
with many languages a system of plosives of the sort /p, t, k, b, d, g/, 
the conventions of realization will not always be the same, English 
preferring presence or absence of aspiration as a crucial distinctive 
feature whereas many other languages, e.g. Romance or Slav, more 
typically make use of voicing ($8.2.11). Finally, as far as consonants are 
concerned, English characteristically permits a large number of con- 
sonant clusters ($9.8). Consonant clusters occur, of course, in many 
other languages, e.g. German and Polish, but the combinations per- 
mitted may differ from those of English. There are nevertheless other 
languages whose possibilities of consonant clustering are much more 
limited, e.g. Spanish or Italian, or whose syllables regularly have a 
simple CV shape, e.g. many Oriental and African languages. It is clear 
that in such cases much practice in English two- and three-consonant 
clusters will be needed to avoid the insertion of intrusive vowels—not 
only initially and finally in the word (where clustering as opposed to 
single consonants has only a 20% occurrence) but also across word and 
morpheme boundaries. 

(c) The choice of phonemes must, of course, coincide with that of 
native speakers using the same type of pronunciation. It has already 
been pointed out (§9.10) that there exists in many cases a free choice of 
phonemes without a denotion of a particular generation or style, e.g. 
/e/ or /e1/ in again; /i:/ or /e/ in Kenya, zebra; /3:/ or /D/ in salt; /u/ 
or /u:/ in room; /sj/ or /f/ in issue; /g/ or /d3/ for the last consonant 
in pedagogy, etc. There are other choices ($12.7.3) which may be deter- 
mined by considerations of generation or style. In all cases of doubt, the 
learner is advised to consult a dictionary! which gives several RP 
variants and to use the variant which, having some advantage in 
frequency of occurrence, is placed first. 


12.6.3 Sounds in context 


English has its own specific habits as regards both assimilation and 
elision (§§9.6, 11.3, 11.3.7), but how far these are adopted by the 
foreign learner will depend on the performance target which he has set 


! The English Pronouncing Dictionary, (14th edition, Dent, 1988) is the most compre- 
hensive for British English. 
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himself (§12.7.3). Particularly affected are the alveolar articulations 
which exhibit notable instability. 


12.6.4 Intonation 


Although nearly all non-tone languages share certain characteristics of 
intonation, e.g. a tendency to use generally falling tunes for declarative 
statements and commands, whereas ultimately rising patterns are often 
associated with questions and non-finality, there are some tunes (as well 
as certain usages as regards the pitch of unaccented syllables) which are 
specific to English. It should be remembered that intonation makes a 
most important contribution to the accentuation patterning of English 
(pitch variation serving to make salient those information points to 
which the speaker wishes to draw attention). To that extent, intonation 
may be taken to fall within the priority category (12.6.1) above. In 
addition, the attitudinal implications of intonation become increas- 
ingly important if a high level of achievement is desired. However, in 
the early stages of learning English, although the intonation patterns 
presented by the teacher should always be authentic, attention should 
be concentrated on the role of pitch in accentual oppositions, especially 
as regards the location in the utterance of pitch-prominent syllables. 
Once this basic level is achieved, variations in tone shape can be 
introduced. 


12.7 RP High Acceptability < 


If high acceptability in RP is defined as a level of attainment in 
production which, for the native listener, is as readily intelligible as that 
of a native RP speaker and which is not immediately identifiable as 
foreign, and as a level of receptive ability which allows the foreign 
listener to understand without difficulty all varieties and styles of RP as 
well as the other important forms of English, it is to be expected that the 
learner will be familiar with the features of current English described in 
Chapters 7—11. His productive performance must be related to one 
specific type of RP. However, just as no two RP speakers of the same 
generation (nor the same RP speaker on different occasions) speak in 
precisely the same way, the learner may be permitted certain tolerances 
in his pronunciation, provided that his speech retains an internal 
consistency, 1.e. avoids the incongruity of a mixture of regional or 
generation styles. Some native speakers are consciously apt to adopt 
unhabitual speech forms, e.g. by attempting to use an American 
pronunciation, when they hope to create a special effect in a particular 
situation. The foreign learner, however proficient he has become, is 
well-advised not to attempt such an effect, in the same way that he 
Should be wary in his use of slang or non-standard grammar. The 
latitude which can be admitted concerns rather the phonetic realization 
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of the segmental phonemes, i.e. the choice of allophonic variants (and 
in particular those of the vowels). In the following sections, the impor- 
tant essentials are listed and suggestions made as to possible tolerances 
compatible with a goal of high acceptability. 


12.7.1 Vowels 


(a) Essentials As mentioned in §12.6.2(b), the learner must have at 
his disposal the 20 vowel phonemes (12 monophthongs and 8 diph- 
thongs), together with the appropriate durational variations (§7.4), 
especially when these latter are significant in such oppositions as seed ~ 
seat, heard N hurt, road ~ wrote, etc. Special attention must be paid to 
the quality/quantity complexes involved in such series as bead, beat, bit 
(§7.9.2(5)) or food, boot, good (87.9.9(5)), these items illustrating the 
occurrence respectively of long tense /1:/, reduced tense /i:/ = [i], 
short lax /i/ and long tense /u:/, reduced tense /u:/ = [u], short lax 
/U/. Although oppositions of only two phonemes are involved here, it 
is important in teaching to identify the three phonetic complexes so as 
to ensure that the vowels of, for instance, beat and boot are not given 
undue length. | 

(b) Tolerances Provided that the oppositions are retained, 1.e. that 
the realizations of phonemes in the same context do not overlap, some 
latitude in performance is possible. Thus, for instance, in the series /1:, 
1, e, 2/, the qualities produced may be somewhat more open than those 
indicated by the mark ® in the diagrams of Chapter 7. Permissible areas 
(as opposed to points) of production encompassing several of the 
variants given in Chapter 7, are shown in Fig. 46. 

/1:/ and /u:/, which are usually pronounced in General RP with a 
slight glide from a more open, centralized position to a closer, more 
fronted position, may be given a pure vowel articulation with a close 
front or back position respectively, similar to the qualities commonly 
associated with vowels of this type in other languages. Though there is a 
danger that such pronunciations may sound hypercorrect to the native 
listener, especially if the vowels used are cardinal in quality, they are to 
be preferred to gliding vowels whose starting point is so open that it falls 
within the area typical of popular regional speech—thus introducing 
unwittingly a comic incongruity if the remainder of the system con- 
forms to the RP model. 

The theoretical areas of tolerance for /a:/ and /2:/ are extensive, the 
limitations being imposed by the presence of /#/ = [æ:], e.g. in an 
Opposition such as cad ~ card, and of /u:/, e.g. in food ~ ford. The 
variations shown in §§7.9.6, 7.9.8 may be permitted. Similarly, the 
variants for /3:/ shown in $7.9.11 may be safely used, provided that 
they do not overlap with the realization of /a:/, e.g. if an open variety 
of /3:/ is used in a word such as burn, this will necessitate a more 
retracted /a:/ in barn. 
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FiG. 46.—RP monophthongs: acceptable areas. 


The diphthongization sometimes associated with RP /1, e, æ/ men- 
tioned in Chapter 7 seems to be growing less common and need not be 
adopted. However, the increasing use of a close vowel (1.e. a short /1:/ 
for final /1/) amongst middle and young generation speakers justifies 
its use by foreign learners, if it is found to be easier than the traditional 
/1/,1n such cases as happy, easy, etc. In the case of final /3/, there is no 
need for a specially open variant to be used (§7.9.12(2)), an excessively 
open variant often being judged comic. 

As stated in §7.9.2, there is often free variation in the use of /a/ or /1/ 
in some unaccented syllables. The pronunciation /a/ appears to be 
increasingly preferred, e.g. for the penultimate vowel in the termina- 
tion -ity in such as word as quality (/-ati/ rather than /-1ti/), -ily 
(especially after /r/) as in merrily (/-ali/ rather than /-1l1/), -ate as in 
fortunate (/-at/ rather than /-1t/). Again, /3/ or /1/ are equally accept- 
able in the terminations -/ess, -ness, -ace, e.g. hopeless, goodness, 
palace, with /a/ gaining ground; but it still seems preferable to retain 
/1/ in -et as in carpet and essential to have /1/ in -age as in cabbage, as 
well as in the inflexion -es, -ed as in horses, waited. 

As for the 8 RP diphthongs, the area of the first element is more 
important than the second element, which is only lightly touched on 
and has little prominence. These areas of onset may acceptably be 
extensive but should not overlap (see Figs. 47, 48). The regional 
connotations of the quality of the starting points (as described in 
$87.10—7.12) should be borne in mind: if, for instance, /e1/ is said with 
a starting point more open than cardinal [e], the impression given may 
be that of popular London speech. 
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al, au 


FIG. 47.—RP falling diphthongs: acceptable onset areas. 





FIG. 48.—RP centring diphthongs: acceptable onset areas. 


In the case of /al, au/, there may bea common (open central) starting 
point (§§7.10.2, 7.10.5); the levelling of /a1ə, auə/, though it has long 
been common in Southern England, is not essential for the foreign 
learner. It should also be noted that /ua/ is increasingly replaced by 
/9:/ not only in such words as poor, sure, but also in moor, tour, etc. 
(§7.12.3(3)). As for /aU/ (§7.10.4), although the general RP form [au] is 
most widespread and should thus be recommended, the more conserva- 
tive form with a somewhat more retracted first element [o] is not 
inadmissible; but learners are advised to avoid the diphthong having a 
fronted onset [ë], this variant now being regarded as excessively 
affected. 
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12.7.2 Consonants 


(a) Essentials The full inventory of 24 consonant phonemes must be 
available to the speaker. 

At this high level of performance, it is essential that the aspiration of 
fortis /p, t, k/ in accented positions (§8.2.1) should be maintained as 
the major factor distinguishing these phonemes from lenis /b, d, g/, 
i.e. constituting a more potent differentiating feature than presence or 
absence of voicing. Learners whose mother tongue relies on voicing as 
the prime feature of opposition (e.g. most Romance and Slav language 
speakers) must take particular care. If, for instance, pat is realized as 
[peet] rather than [pet], an English listener is likely to understand bat, 
the absence of aspiration suggesting /b/ to an English ear. Similarly, 
/1/ and /r/ following accented /p, t, k/ (and to a lesser extent /j/ and 
/w/ since oppositions involving preceding fortis/lenis plosives are rare) 
must be devoiced, as if thus signalling aspiration, so that plead and 
bleed are distinguished primarily by [l] in plead or pray and bray 
primarily by [1] in pray. 

The articulation of /t, d/ must be clearly alveolar ($8.2.5(5)), and 
when followed by the homorganic syllabics /n/ and /l/, as in button, 
sudden, little, middle, the release must be nasal or lateral respectively 
without an intrusive vowel (§8.2.3(4, 5)). 

It is also characteristic of highly acceptable RP that the first plosive 
of stop sequences should have no audible release (§8.2.3(2)), e.g. in 
actor, black tie, rugby, big dog. Intrusive vowels should be avoided in 
clusters of other consonants, e.g. between /s/ and /p, t, k/ as in sport, 
strike, school or in such final sequences as /lmz/ (films) or /Inz/ 
(kilns). 

Alveolar /s, z/ must remain clearly distinct from dental /6, 0/ and 
palato-alveolar /f, 3/ (88.4). 

The /1/ phonemeshould have the qualities and correct distribution of 
allophones mentioned in §8.7.1, i.e. [l, t, 1]. Similarly, /r/ should have 
a post-alveolar approximant or frictionless continuant articulation 
rather than any kind of trill ($8.7.1). High performance RP will make 
use of the linking /r/,e.g. for far away /'fa:r a'we1/ is more typical than 
/'fa: a we1/; and for pour out /'pa: r 'aut/ is preferred to /'pa: ‘aut/. On 
the other hand, ‘intrusive’ /r/, asin vanilla /r/ ice, is best avoided by all 
but the most competent, lest the usage be extended to those other cases, 
e.g. I saw /r/ it, draw /r/ ing room, which are often stigmatized by RP 
speakers (§11.3.7). 

(b) Tolerances Certain of the variants or allophones described in 
Chapter 8 may be disregarded without detriment to an impression of 
highly acceptable performance. For instance, the devoicing of lenis 
consonants /b, d, g, d3, v, 0, z, 3/ In post- or pre-pausal position is not 
crucial: full voicing is permissible provided that a [a] off-glide is not 
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added finally to a word such as big (['big?]) so that it is decoded as 
bigger. 

Final (1.e. pre-pausal) plosives, although usually without audible 
release, as in /it, lid, may be given an explosive last stage, the resultant 
impression being one of careful speech. The use of [?] (§8.2.7) or glottal 
reinforcement (§8.2.3(3)) in connection with the fortis plosives and 
affricate is never necessary; nor is the affrication or weakening of 
plosives mentioned in §8.2.3(6). 

Although, as mentioned in previous sections, it is important to gain 
articulatory skill in producing the numerous RP consonant clusters, it 
should be remembered that sequences of more than two consonants 
with simple words are comparatively rare; a greater test of articulatory 
skill occurs at morpheme and word boundaries, e.g. in corkscrew, fixed 
price, pitched battle, sobbed bitter, etc. 


12.7.3 Sounds in context 


(a) Elision The examples of post-nuclear vowel elision in the words 
quoted in $9.6(1, b), e.g. loss of /a/ in comparable, factory, dangerous, 
carefully, etc., must be regarded as optional, the retention of the weak 
vowel being entirely acceptable. In pre-nuclear positions, e.g. in polite, 
solicitor, the foreign learner is best advised to avoid elision of the weak 
vowel, a practice which is characteristic of rapid, casual speech. 

On the other hand, the alveolar consonant elisions mentioned in 
§9.6(2, b) can be adopted by the foreign learner, especially those 
involving the simplification of a three consonant cluster by the elision 
of a medial /t/ or /d/, e.g. in restless, kindness, where retention of the 
alveolar stop can be regarded as unusually careful. 

In connected speech, considerable elision of both vowels and con- 
sonants {s characteristic of a colloquial style (§11.3.6). It is not neces- 
sary, however, for the foreign learner (even with a high acceptability 
target) to seek to adopt all these native speaker habits. Once again, he 
may restrict himself to the elision of alveolar stops occurring medially 
in clusters of three consonants (especially when /t/ or /d/ is followed 
by another stop) brought together at word boundaries, e.g. in left turn, 
mashed potatoes, jogged by, wind down, etc. 

(b) Assimilation It is assumed that the competent foreign learner 
will make most of the allophonic contextual assimilations mentioned in 
§11.3.1, these usually resulting from the inevitable accommodation 
which takes place between two articulations in the speech continuum. 
Certain changes which may provide cues to meaning, e.g. the devoicing 
of /1/ and /r/ after accented /p, t, k/ mentioned in 12.7.2, (a) above, 
are essential. In the case of assimilations involving changes of phoneme 
(§§11.3.2—5), especially at word boundaries, it should be remembered 
that these are important because they have stylistic implications. Thus, 
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if a native speaker elides medial /t/ and /d/ in consonant clusters, it is 
likely that he will also assimilate word final /t, d, n, s, z/ at word 
boundaries as suggested in §11.3.5. It is therefore consistent for the 
highly competent foreign learner to adopt similar habits. He must 
always be careful that in seeking to adopt an informal, colloquial style 
of English speech he does not introduce assimilatory habits which are 
characteristic of his own native language but not of English (§11.5). 

However, what is more important than the development of pro- 
ductive skills in either elision or assimilation is the cultivation of the 
ability to receive and decode the highly elided and assimilated speech 
produced by native English speakers. Such speech must be regarded as 
the norm in all ordinary circumstances. The foreign student must there- 
fore spend aconsiderable amount of time listening to normal colloquial 
English in order to acquire skill in identifying information points in an 
utterance (making full use of all contextual and syntactic cues and prob- 
abilities), when apparently crucial sounds are absent or when the isolate 
form of a word has become greatly changed in context, e.g. /fwi/ is a 
frequent telescoped form of /falwi/ (shall we). The native speaker is, of 
course, usually unaware that he makes reductions of this sort. 


12.7.4 Accentuation 


It has already been pointed out (§12.6.1) that the accentual features of 
English must form the base for the teaching of the language’s pronun- 
ciation. These include the correct reduction of unaccented grammatical 
items (weak forms—§10.4(2)). In connected speech, the weak forms of 
the most common grammatical items occur overwhelmingly more fre- 
quently than the strong forms, and, from this point of view, may be 
considered the usual pronunciation. However, if a native speaker is 
asked to pronounce such common words as and, but, at, of, etc. he will 
naturally give the rare citation (accented) forms /zend, bat, zt, pv/ 
rather than the more common /ond, bat, ot, ov/. He is genuinely 
unaware that he regularly reduces such words when unaccented in con- 
nected speech and will often deny that he would normally pronounce a 
phrase such as And there was a knock at the door as /an 02a woz 2 ‘nok at 
Oa ‘do:/, describing such a speech style as careless. It follows that it is 
wiser to listen to the way in which the native speaks rather than to ask 
his opinion. Similarly, most dictionaries understandably record the 
strong pronunciation of these grammatical words, without always 
giving the weak alternatives. The foreign learner—at any level—must 
regard the strong forms as being ‘marked’, 1.e. having a special 
meaning compared with the ‘unmarked’ sense of the usual weak forms: 
thus, the oppositive, emphatic meaning of / ‘can /ken/ come, Pm 
going ‘to /tu:/ London with associated stress accent may be compared 
with the same phrases said neutrally with /kon/ and /ta/. However, 
the uncommon reduced forms, heard only in very rapid speech and of 
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which some examples are given in the last part of §10.4(2), need not be 
imitated by foreign learners, even those with the highest target. The use 
of /ja/ or /ma/ in such phrases as your mother, my father has social 
connotations and, if employed inappropriately by a learner, could 
appear comically incongruous. 

The proficient learner will, however, make use of those variations in 
the accentual patterns of words which are the result of the general 
rhythmic pressures of extended speech as opposed to the patterns of 
isolate words ($11.2). Such variation affects particularly adjectives or 
the adjectival element of a noun phrase, the predicative pattern (cor- 
responding to the isolate form given in dictionaries) being regarded as 
normal, whereas the attributive pattern displays the variant, e.g. 
compare the patterns in citation or predicative situations as in My car’s 
secondhand [,--'-], This is Waterloo {,--'-] and the attributive variants A 
secondhand ['---] car, Waterloo ['---] Station. Such variation, ‘stress- 
shift’, is essential for high acceptability. 


12.7.5 Intonation 


The intonation of English has been studied in greater detail and for 
longer than that of any other language. No definitive analysis, classi- 
fying the features of RP intonation, has yet appeared (though that 
presented by O’Connor and Arnold (1973) provides the most com- 
prehensive and useful account from the foreign learner’s point of view). 
Difficulties arise not so much in the accentuation role of intonation, i.e. 
the use of pitch as a prime factor in making a syllable or word promi- 
nent amongst its neighbours, or in the association of tone patterns with 
general grammatical structure ($10.5.1ff). It is rather in the assignment 
of emotional (attitudinal) colouring to one or other tone pattern that 
disagreement still exists, even when discussion is restricted to a single 
type of pronunciation such as RP. One reason for such disagreement is 
that there cannot be said to be a simple one-to-one correlation between 
a particular tone pattern and a particular attitude, the tone pattern 
which manifests an attitude depending very much upon the intona- 
tional environment; moreover, the attitude itself is not easily defined in 
discrete terms. Thus, not all speakers would agree upon what con- 
stitutes a perfunctory or detached attitude in making a statement nor 
upon which tune should be chosen to signal such an attitude. Similarly, 
a listener may misinterpret a speaker’s attitude because his (the 
listener’s) intonation habits are somewhat different, although the two 
segmental systems do not differ. 

Nevertheless, there is sufficient common ground in the correlation of 
attitude and tone pattern for the highly competent student to be confi- 
dent in making use of this other function of intonation, in addition, of 
course, to the essential role of signalling the location of nucleus and 
accented syllables. He should be skilled, for instance, in the use of the 
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falling-rising pattern (§10.5.2.3), occurring both on a single word and 
spread over two or more words. This pattern is extremely common in 
RP and is important for conveying many nuances of doubt, encourage- 
ment, unspoken implications, etc., which in other languages might 
require more explicit lexical or syntactic cues. The learner should also 
note the frequent use of emphatic falls on pre-nuclear accented syllables 
(capable also of being identified as multi-nuclear patterns as in 
§10.5.2.4) which are a common feature of natural discourse—as 
opposed to much reading aloud. 

Despite the encouragement of many textbooks t to use rising intona- 
tion patterns for Yes/No questions and falling patterns for X-questions 
(i.e. those containing an interrogative word), the learner should note 
that a rising intonation is at least as common for the latter type, 
especially when the enquiry is polite or appealing. The intonation for 
question-tags should also be correctly used: a falling tune expecting 
confirmation of a statement (e.g. ‘isn’t it?), whereas a rising tune 
indicates a request for information (e.g. ,isn’t it?). 

Finally, the learner aiming at the highest achievement should be 
aware of the attitudinal importance of the general pitch level of the 
intonation head and body preceding the nucleus, a high level usually 
indicating enthusiasm, Interest, liveliness compared with a low level 
which may give an impression of a perfunctory, uninterested, surly or 
suspicious attitude. 


12.8 Minimum General Intelligibility 


This type of performance has been described in §12.5 as one which 
preserves the chief elements of the RP system and is capable of con- 
veying a message with some ease (in a given context) to a native English 
listener. It is not necessarily related specifically to any one form of 
English. It is regarded as essential that the accentual characteristics of 
English (including rhythmic features and the associated obscuration 
of weak syllables') should be retained, as well as the ability to produce 
the common consonant clusters. It 1s in fact possible to reduce the 
segmental inventory of English very considerably and still retain a good 
level of intelligibility.” For instance, the vowel system can be reduced to 
a central pair /9:/ and /a/. If this is done the sentence /'won da jo Bak 
da tfaldran wal gat ‘ha:m fram 'ska:1?/ is not too difficult to interpret as 
“When do you think the children will get home from school?’, especially 
if the situation predisposes the listener to understand ‘children’ and 
‘school’. The disadvantage of such a reduction is that it produces a 
form of English unlike any natural speech, with the attendant problem 
that native speakers would have difficulty in teaching it. 


l Minimally reduced forms can be employed if they are thought to be easier, e.g. /and 
for and rather than /an/, /nd/ or /n/; he, his, etc. can retain the /h/. 
-See In Honour of A. S. Hornby, op. cit. ; 
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It follows that any simplified form of pronunciation should have 
features with most of the major natural forms. In addition, the model 
should have three requisites: — 


1. It should be at least as easy for the foreign student to learn as any 
natural model. 

2. It should be readily intelligible to most native speakers of English. 

3. It should provide a base for the learner who has acquired it to 
understand the major natural varieties of English. 


Given that the accentual features common to all natural forms of 
English must be retained, it is in the segmental phonemes that sim- 
plification may be expected. An amalgam of British RP and a generally 
accepted form of American pronunciation would seem an ideal 
solution, representing the great majority of native English speakers. 
The result would also show a more obvious affinity to the orthography 
of the language than does RP. (It is undoubtedly true that English 
orthography, though often maligned for its apparent lack of cor- 
relation with present pronunciation, does exhibit underlying regular- 
ities applicable to a number of forms of English.) The preservation of 
the pronunciation of post-vocalic /r/ in words like farm, heard is an 
example of closer adherence to orthography which may in fact reflecta 
majority pronunciation and which might be helpful to some learners 
who do not wish to identify their English pronunciation with any one 
accent. However, apart from this potential presence of post-vocalic /r/ 
(and the recommended omission from the system of /hw/), the major 
forms of English exhibit considerable homogeneity in their consonant 
systems, which offer no further possibilities of simplification if they are 
to retain some resemblance to a natural system. 


12.8.1 Vowels 


The vowel system may be simplified in both phonemic and phonetic 
respects, while still keeping an acceptable level of intelligibility. 

(a) The centring diphthongs /1a, ea, Ua/ (87.12.1), for instance, are 
of comparatively rare occurrence (1.5% of vowels; 0-7% of all 
phonemes in connected speech). They can always be interpreted as V + 
/r/ when appropriate, e.g. peer as /p1:r/; pair as /peir/; and poor as 
/pu:r/ (though in the latter case it would be more economical to replace 
RP /ua/ by 2:/). Where there is no r in the spelling, e.g. in idea, skua, 
M:/0r /1/ + /a/ and /u:/ or /U/ + /a/ would be used. 

(b) Similarly, many cases of long vowels can be made to conform to 
the orthography by the retention of post-vocalic /r/, farm, four, heard 
might well have V + /r/, though such words as calm and saw would 
retain their /r/-less form. The unique case of /3: / without rin colonel 
could conform to V + /r/ by analogy, as often in American English. 

(c) Finally, it would doubtless be simpler for many foreign learners if 
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certain phonetic adjustments in the realization of vowel phonemes were 
made, by using Cardinal Vowel values (§4.4.2) wherever possible. 
Thus, /1:, uz, 9:/ would have cardinal values; /e1/ and /aU/ would be 
realized as long cardinal [e] and [o] (as they are in many forms of 
English); /e/ and /æ/ would be pronounced as cardinal [e] and [a]; 
/a: / would be long cardinal [a]; /A/ and /3/ would have the quality of 
accented and unaccented mid central [a], with a long counterpart of 
similar quality; /1, U, 0/ would remain as now; /ai/ and /au/ would 
have a similar (front) starting point. The resulting 16 vowel inventory 
would be as follows: — 


6 short monophthongs: /1/ (sit); /e/ (set); /a/ (sat); /D/ (sot); /u/ 
(soot); /a/ (another) /a nadar/ 

7 long monophthongs: /1:/ (bee, beer); /e:/ (pay, pair); /a:/ (calm, 
car); /9:/ (paw, pour, poor); /0:/ (home); /u:/ (do, dour); /3:/ 
(bird) 

3 diphthongs: /a1/ (buy); /au/ (cow); /91/ (boy) 


(The vowel relationships are shown in the diagrams given in Figs. 49, 
50.) 

The following transcription demonstrates the use of such a vowel 
system with a full RP consonant inventory:— 


al 'do:nt ju: 3ol1 kom ‘hitr bai 'ka:r—Ó9 'tre:nz a ‘gud 'di:l mo:r 
kon'virnjant wen wi:r ‘spendin ə fju: 'avarz in 'taun wid da ‘boiz. 
ju: ‘haf to bi: 'so 'a:rli if ju: ‘wont to 'faind a 'pa:rkim ¡ple:s. al 
‘Oink 'mo:st 'gara:d31z 'tfa:rd3 'fazr tu: 'motf for ‘purr 'fo:k laik 
as. On 'rezlfezrz 'tfi:poar—so0: 'fal wi: 'mi:t at da 'ste: fən?! 





FIG. 49.—Monophthongs: simplified system. 


! In orthography: ‘I don’t usually come here by car—the train’s a good deal more 
convenient when we're spending a few hours in town with the boys. You have to be so 
early if you want to find a parking place. I think most garages charge far too much for 
poor folk like us. The rail fare’s cheaper—so shall we meet at the station?’ 
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ZER j 


FIG. 50.—Diphthongs: simplified system. 


Note.—It is also conceivable, if minimum intelligibility is the target, 
that the variations of vowel length (e.g. the significant reduction of /i: / 
in feet as opposed to the same vowel in feed) might be disregarded—the 
fully long variety being used in all contexts. The loss of such an impor- 
tant cue to meaning throws even greater weight on to the sense-defining 
function of the context. 


12.8.2 Consonants 


As far as consonants are concerned, it is important that all phonemic 
oppositions should be maintained, though the optional /hw/ (§8.9.1(4, 
5)) may be omitted. Nevertheless, certain phonetic tolerances may be 
permitted: — 

(a) In the phonemic series /p, t, k, b, d, g/ (where presence or 
absence of aspiration is typically significant in initial accented positions 
in most native forms of English) voicing may be used as the distinguish- 
ing cue, if only the most basic type of intelligibility is the aim: thus, pie 
would be distinguished from buy by the presence of voice in /b/ and its 
absence in /p/ (as is the case in the plosive systems of many languages 
such as those of the Romance and Slav families). Again, the loss of such 
an important cue implies great dependence on the context for the 
definition of meaning. 

Less communicative harm is done if the articulation point of /t, d/ is 
dental rather than alveolar. Most languages use a dental contact point 
for this type of stop, and to do so in English entails no loss of intelli- 
gibility, though the phonetic deviation is usually immediately apparent 
to an English ear. It is much less permissible (from the point of view of 
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communication) to replace, as many speakers of Indian languages tend 
to do, /t, d/ by retroflex [t, d] and /8, 6/ by dental aspirated [t", d*]. 

(b) The sound [yn] may lose its phonemic status, in the sense that, 
besides occurring regularly before /k/ (as in think), it will retain a 
following voiced velar plosive element elsewhere, e.g. sing, singer [sing, 
'sijgo], where RP omits [g]. [9] is then a conditioned variant of /n/ 
before velar plosives. Thus, hanger and anger with /n/ and /ng/ 
respectively in RP may both have [ng] if found to be easier at the 
rudimentary level of achievement. Such a usage corresponds to that of 
several British English regional forms (§8.6.2). No two English words 
are distinguished in meaning by the opposition of /n/ and /ng/. 

(c) The complex English fricative system must be retained, but some 
relaxation of the articulation of the dental /0/ is allowable in weak 
positions ($8.4.3(3)), especially in the case of the following /s/ or /z/, 
e.g. in “What's the time?’ or ‘Is there any left?’. In such positions, it is 
difficult in rapid, casual speech to articulate with any precision a truly 
dental sound immediately after an alveolar one. The English often 
replace it with a dentalized [z]; if a foreign learner uses a [z] sound in 
such positions of weak prominence, it may be hardly noticeable. 

(d) The complex realizations of RP /1/ (§8.7) may be simplified; in 
particular, the ‘clear’ allophone may be used (as in some forms of Irish 
English) where the ‘dark’ [t] is normal in RP, without loss of intelli- 
gibility. The devoiced [l] allophone following initial accented /p, k/, on 
the other hand, is an important carrier of meaning in an opposition 
such plot a blot. It follows that, if voicing rather than aspiration is used 
as the distinguishing feature of /p, k/ v. /b, g/ as suggested in (a) 
above, the devoicing of /1/ after /'p, k/ is less likely to occur; if this 
type of pronunciation is adopted, the listener’s decoding of the message 
may again have to rely very heavily on the general context. 

(e) There is little disturbance to comprehension if /r/ (§8.7.1) is 
realized as an alveolar tap or lingual roll (even in a post-vocalic 
position) rather than as the more typical RP post-alveolar frictionless 
continuant or approximant, since these variants occur in several British 
regional forms of speech. A uvular articulation, however, though inter- 
fering little with intelligibility once the listener has adjusted and though 
still occurring in the speech of some speakers in North-Eastern 
England, is always perceived as unusual. As with /1/ in (d) above, the 
devoiced allophone of /r/, the main cue distinguishing pray from bray, 
may be dispensed with, thus forcing the context to carry a heavy burden 
of information. There is no loss of intelligibility if the linking /r/ is not 
used (§§11.3.7, 12.7.2(a)). 

(f) Although the use of syllabic /n, 1/ is typical of RP, intelligibility 
does not suffer if /9/ is inserted even after an alveolar consonant (cf. 
§12.7.2(a)), as in mutton, hidden, fasten, little, middle, etc. Following 
/t, d/, however, the insertion of /3/ before /n, 1/ will be perceived as 
strikingly unusual—and even childish. 
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12.8.3 Intonation 


Since correct rhythm and accentuation (with the associated reduction 
of weak syllables) are regarded as indispensable for any level of spoken 
English (§12.6.1), all learners should be proficient in making use of 
pitch movements as important cues for signalling salient words or 
syllables. Those learners whose own language uses pitch variation for 
grammatical rather than for lexical (tone) purposes are likely to be able 
to transfer much of their own intonation usage into English without 
serious disturbance of intelligibility. (If, however, they ignore the 
specifically English attitudinal uses of intonation, they will run the risk 
of giving an unintended impression of, for instance, abruptness or 
boredom!) On the other hand, those learners whose mother tongue is a 
tone language may find the concept of intonation’s functions as they 
occur in English and other European languages entirely novel. They 
must master, in addition to the essential pitch features related to accen- 
tuation, the basic intonation patterns which signal grammatical cate- 
gories, e.g. the typically falling intonation for statements and the rising 
tunes so often used in questions. This is particularly important when, 
for instance, the native language makes use of such devices as particles 
for signalling questions, without any significant pitch variation. 


12.8.4 Sounds in Context 


At these lower levels of performance no attempt need be made to 
imitate native English habits of elision or assimilation. 

The result of permitting all the concessions mentioned in $12.8, 
involving the acceptance of an amalgam of segmental and prosodic 
features which occur together in no one natural form of English, is 
likely to bea style of speech which, though sounding markedly artificial 
or foreign, will be generally intelligible to most native English speakers. 
The native listener will need to adjust his decoding habits in much the 
same way that he does when listening to a native speaker using a 
regional accent of English which differs considerably from his own. 
This rudimentary form of pronunciation is not, however, to be 
regarded as a teaching target, but rather as a minimum level of achieve- 
ment consistent with some degree of intelligibility; severe problems of 
communication will be caused by any greater divergence from a natural 
model. A serious learner with the most modest aims, working with or 
without a teacher, should seek to match his pronunciation with a 
natural type of English. 


12.9 Teaching Methods 


Learners’ situations and requirements present so many variables that it 
is difficult to give advice of general applicability. Is the learner a child 
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or an adult? Is he learning in a class with a teacher or by himself with the 
help of recordings or broadcast lessons? Is he to learn English pro- 
nunciation as part of a course extending perhaps over several years or is 
he attempting to acquire the essentials in a much more limited time? 
Whatever the answers to these and other questions, it must be accepted 
that in nearly all cases he will be seeking to learn English in an artificial 
fashion, i.e. very differently from the natural way in which he acquired 
his mother tongue, with constant exposure to the language of his family 
environment and with strong (instinctive) motivation to learn an 
efficient verbal means of communication. Nevertheless, it is some five 
years before he attains proficiency in the basic skills of speech produc- 
tion and reception. 

What is clear is that, in teaching pronunciation, we are concerned 
especially with imparting motor and auditory skills rather than with 
inculcating the kind of logical agility such as may be involved in the 
acquisition of anew grammar. It is true that the learner must assimilate 
rules for the distribution of allophones or for the assignment of intona- 
tion patterns to the appropriate grammatical or attitudinal context; he 
may even learn to apply rules for the derivation of word accentuation 
patterns from the orthography (most usefully with respect to the 
relation of the word accent and suffixation). But his knowledge of the 
rules is of little value if he is unable to transmit a rule in efficiently arti- 
culated speech. 

Since it is generally the case that the acquisition of a second 
language’s pronunciation becomes increasingly difficult after early 
adolescence, it is obvious that ideally it is desirable to teach pronuncia- 
tion as soon as possible. With children it is often the case that the ability 
to mimic is retained to such a high degree that a demonstration of some 
feature of pronunciation, without further explanation, is sufficient 
(thus emphasizing how especially important it is that a teacher’s own 
pronunciation should reach the highest level possible). When this 
ability is no longer present, as 1s the case with most adult learners, every 
effort must be made to overcome the interference from the sound 
system of the first language, through which the new, foreign (English) 
sounds are being filtered. Before attempting to produce English 
sounds, accentual or intonation patterns, it is invariably advisable to 
teach and establish certain basic discriminatory skills: the learner must 
be able (a) to distinguish with certainty between features of his own 
language and those of English; (b) to distinguish between the contras- 
tive features of English. 

The acquisition of such new auditory skills can be achieved by the use 
of extensive discrimination drills in which the learner is required to 
judge identities and differences in stimuli presented to him by a teacher 
or in a recording. Such “ear training’ is, of course, particularly impor- 
tant for those who aim at the higher levels of achievement in pronuncia- 
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tion where phonetic precision is essential. It is, however, not generally 
useful to expose a learner to a tape in a language laboratory without any 
preliminary instruction on the features which he is expected to perceive. 
The recorded material should be graded so that the discrimination tasks 
become progressively finer. The following sections illustrate possible 
procedures: ! 


12.9.1 Production: Vowels 


Many learners of English have difficulty with the short vowel series 
exemplified by pit, pet, pat, putt, pot, put. The series should first be 
presented to them, either in words having a similar consonantal frame 
or, preferably, in isolation. After listening, the learner should be asked 
to identify the items presented in a number of randomly-ordered sets. 
(It is helpful if at this stage the learner is able to relate an English sound 
to a phonetic symbol or to a number, so that the success of his iden- 
tification can be checked without reference to orthography, which can 
often be ambiguous.) In the next stage, the learner listens to examples 
of the same vowels (which by now he can identify correctly), but this 
time vowels of his own language are interspersed with the English ones. 
He must identify what is English and what is not. Finally, if maximum 
phonetic precision is the aim, in a similar test vowel qualities which are 
neither those of English nor of his own language are included. It is 
important in this case that he should succeed in identifying as non- 
English those qualities which deviate both from English and from his 
own language. 


12.9.2 Production: Consonants 


Similar exercises can be used in introducing possible difficult features 
of the English consonantal system. For instance, in the case of the 
importance of aspiration in distinguishing initial /p, t, k/ from /b, d, 
g/, it will be appropriate to present the English series for recognition 
(after describing the aspiration feature), and then to insert examples of 
strong but unaspirated [p, t, k] sounds to be identified as non-English 
by the student. When it is a question of drilling such non-continuant 
consonants, it is of course necessary for them to be articulated with a 
vowel sound following. 

The emphasis placed on auditory discrimination drills is important in 
a more general sense. Particularly in the case of vowel articulations we 
monitor and adjust the quality of the sounds we produce very largely by 
means of the control provided by our own auditory feedback. The most 


! See my Practical Course of English Pronunciation: a Perceptual Approach (Edward 
Arnold, 1975) in which drills of this kind are presented for use with or without a teacher. 
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successful way of correcting a vowel sound is to indicate the auditory 
relationships between similar sounds. There is little in the way of articu- 
latory instruction that can be given, since the significant movements of 
the tongue, for instance, in the mouth are very slight. (An obvious 
exception to this generalization is the case of /3: /, for which it is useful 
to insist on the neutral (rather than rounded) position of the lips.) As 
for consonantal articulations, for most of which there is tactile sensa- 
tion of contact or stricture, it is possible to give helpful articulatory 
instructions, e.g. as regards the alveolar rather than dental nature of 
¡nasal etc. 


12.9.3 Production: Accentuation 


The perceptual approach used in teaching segmental phonemes can also 
be employed in the case of accentual features.! Learners (and especially 
those, such as tone language speakers, for whom the English concept 
of accentuation is quite new) must be taught to appreciate variation 
in the accentual patterns of English polysyllabic words, the varying 
rhythms of connected speech and the reduction of weak syllables in the 
utterance. 

In the case of polysyllabic words, correct pattern identification (pre- 
ferably by using such nonsense sequences as [l111, 1111] or [‘la: lala, lola: 
la] etc.) should precede drills involving differing patterns in English 
words, e.g. ‘behind’ v. “under”; or “yesterday” v. ‘im'portant’ y. 
maga zine’ v. “photo graph”, etc. (see also $9.7). 

Similarly, the learner should be presented with the different rhythmic 
patterns involved in such sequences as ‘The car’s come; the carpet’s 
come; the carpenter’s come’,1.€.[ . 6 e; e.o; . o NEE 
before attempting to produce the varying rhythms himself. In the same 
way, he should listen to passages of connected speech in order to 
appreciate the compression and obscuration of weak syllables, e.g., in 
‘Two of the children were waiting at the corner of the road for a bus’. It 
should be insisted that, in performance, it is the prominent syllables 
/'tu:—'tfil—'weit—'ko:—'raud—'bas/ which should be given most 
weight and that, 1f the intervening weak syllables are produced with a 
full form, the result will be loss of intelligibility for the English listener. 


12.9.4 Production: Intonation 


Learners may also by the same technique be led to appreciate the way in 
which pitch changes signal a shift of nucleus ina phrase where any word 
may carry primary accent.’ In a sentence such as ‘This is my book’, the 


! See my Practical Course of English Pronunciation, op. cit., Section 4. 
2 Ibid., Section 5. 
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nucleus may meaningfully be associated with any of the four words. 
Different versions should be presented to the learner until he is able to 
identify the nuclear placement with certainty. The more difficult 
exercises which must follow concern the recognition of the type of 
nuclear tone used (§10.5.1.2) and the appreciation of the pitch levels 
occurring before and after the nucleus (§§10.5.1.3—4). As in all other 
drills, correct recognition must precede attempts at production. 


Learners with different linguistic backgrounds will, of course, 
experience different difficulties in appreciating the distinctive elements 
of English. It is for this reason that a teacher should be aware of the 
phonetic and phonological characteristics of the mother tongue of his 
students (and of their particular local variety of this first language). By 
contrasting the feature of the two languages, he will be able to predict 
the problems which will arise and on which he should concentrate his 
drills; he will also be able to make use of phonetic resemblances between 
the two languages which may not be readily evident to the learner. 

Thus, a teacher of students whose language belongs to the Germanic 
or Slav families will not expect to encounter problems with the basic 
concept of word accentuation, which will almost certainly offer diff1- 
culty to speakers of, say, French or many Indian languages as well as to 
those who speak a tone language. On the other hand, as an example of 
the use of similarities, benefit may often be derived from relating 
English /1/ to a vowel of the cardinal [e] type, which exists in many 
languages not possessing a centralized vowel such as English /1/. Or 
again, many learners who cannot easily articulate the English ‘dark’ [t] 
may in their own language have a vowel of the cardinal [o] quality, 
which is an excellent starting point (or even substitute) for English [t] 
(and is indeed used by many native speakers for [t) —$8.7(3)). 


12.9.5 Reception 


The importance of the receptive side of language learning as distinct 
from the productive aspect has already been touched upon (§12.2.2). 
Although it is normally sufficient for the foreign learner to become 
proficient in his production of but one type of English pronunciation, 
adequacy in the reception (i.e. comprehension) of more than one style 
of RP is essential if communication with native speakers is to be 
efficient. This can be ensured not only by exposing students to broad- 
casts and recordings of natural speech (in a variety of styles including 
the most rapid and those which are highly elliptical and accompanied by 
‘noise’), but also by means of dictation exercises of a more analytical 
kind. The learner will often find it rewarding to transcribe phonetically 
various utterances of differing degrees of formality and casualness, 
noting the stylistic markers. 
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As students become more proficient, it will certainly be necessary, if 
for instance their first approach to the language has been through RP, 
for them to acquire facility in understanding a representative type of 
American English. Other accents should be taught sparingly. The 
choice will depend upon the ability and the needs of the learner: an 
attachment to American films and television programmes will require 
an acquaintance with a wide range of American accents; or a visit to 
London may suggest that the acquisition of some receptive ability in 
popular London speech might be useful. Practice in reception and com- 
prehension (whether it is by means of radio or television programmes or 
by specially recorded material) should however be supervised by a 
teacher who is able to judge how far the various styles of speech avail- 
able are appropriate to a foreign learner’s needs. It would obviously, 
for instance, be a mistake to accept unquestioningly as a model all that 
is broadcast on the home services of BBC radio. Apart from the speech 
of the newsreaders, who generally do not depart markedly from an RP 
norm, a great variety of accents, representative of different regions and 
cultures and of differing degrees of acceptability as far as the foreign 
learner 1s concerned, may be heard on the BBC: broadcast plays and 
comedy programmes are Often given in regional accents, with a good 
proportion of slang and near-taboo vocabulary; commentaries on 
soccer matches are unlikely to be given in more than a modified form of 
RP, whereas cricket commentaries (less attractive or accessible to the 
foreigner) tend to have a higher percentage of RP voices. It is advisable, 
therefore, for the foreign learner, faced with such complexity of 
accents, to restrict his proficiency to the level of reception, not allowing 
his production model to be modified incongruously and unacceptably 
by indiscriminate exposure to non-standard pronunciation. It is often 
the more able student, especially sensitive to variations of accent, who 
may tend to be influenced by and to imitate whatever he hears, unless he 
receives guidance. 


12.9.6 Assessment 


There remains the problem of the assessment of a learner’s perfor- 
mance, from the point of view of both reception and production. 

(a) A learner’s achievement in comprehension can obviously be 
tested and quantified by measuring the amount of information which 
he has derived from a passage of colloquial speech, e.g. by scoring the 
number of information points which have been correctly received 
(excluding those items such as proper names, etc., which may have been 
introduced to the listener for the first time). A score of this kind can be 
obtained either by questions on the text or by requiring the student to 
write down what he has heard. The test, when conducted with learners 
of high achievement, should involve passages in different pronuncia- 
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tion styles. The recognition of information points will, of course, be 
mainly concerned with the accented parts of an utterance and will be 
made easier by factors of redundancy and predictability. A test which 
involved the correct identification of intervening weak, grammatical 
items and inflexions would obviously be a measure of total compre- 
hension; although this type of detailed decoding goes beyond that 
which a native speaker normally employs when listening to his own 
language, it is nevertheless a good analytic exercise for the learner. 

(b) In the case of production, an assessment of efficiency is more 
difficult. An atomistic approach can be used, whereby the control of 
phonemic oppositions is tested through the reading aloud of word lists 
and short sentences containing crucial minimal pairs. Similarly, lists of 
words exemplifying a variety of accentual patterns will test this area of 
the learner’s proficiency. Various types of sentence (the grammatical or 
attitudinal context being given) can also be used to assess appropriate 
sentence accentuation and choice of nucleus. If, however, read texts are 
used (even if, as lengthy passages or dialogues, they are specially 
contrived to exemplify the maximum number of features of segmental 
phonemes and connected speech), the artificiality of the procedure 
should be recognized and allowed for, since a certain unnaturalness of 
style is likely whether it is a native speaker or a foreign learner who is 
reading aloud. 

At a higher level of achievement, phonetic quality must also be 
measured, the higher the level the more precise the phonetic specifica- 
tion in terms of the target model. Here the teacher’s role is vital, since it 
is he who must judge (usually by ear) the extent to which the learner 
approaches the model. He will pay particular attention to the features 
of connected speech mentioned in §12.7. 

The danger of an atomistic method of assessment meticulously 
applied is that departures from a norm will usually be found to be 
numerous, as much for the successful learner as for the one of lower 
ability. A simple aggregate of noted errors, undifferentiated in respect 
of their seriousness as far as communication is concerned, does not 
always provide a reliable indication of good or bad performance. A real 
assessment must be based on the intelligibility and acceptability of a 
learner’s performance, in a situation of free discourse with a native 
speaker, when many of the so-called ‘errors’, not being perceived by the 
native listener, may be regarded as trivial and ignored. 
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THE INTERNATIONAL 


Dental, 
Bilabial  Labiodental | Alveolar, or  Retroflex 
Post-alveolar 


Nasal 
Plosive 


(Median) 


Fricative 


(Median) 
Approzimant 


Lateral 
Fricative 


Lateral 
(Approzimant) 


Á 
(pulmonic air-stream mechanism) 


Trill 


S 


Tap or Flap 


N 


Ejective 


Implosive 


O 


(non- pulmonic 
-stream) 


(Median) Click 


aur 


C 


Lateral Click 





DIACRITICS OTHER SYMBOLS 


ə Voiceless y q . or . Raised e, e, e w œ, 2 Alveolo-palatal fricatives 
+ Voiced 8 t "or, Lowereder,e,e ¢ J, 3 Palatalized f, 3 
h Aspirated t! , Advanced u+, u T Alveolar fricative trill 
.. Breathy-voiced b a - or - Retracted i, i-, t I Alveolar lateral flap 
, Dental t ” Centralized ë f) Simultaneous f and x 
_ Labialized t ~ Nasalized á Js Variety of f resembling s, 
y Palatalized t, 1 1 E r-coloured a etc. 
- Velarized or Pharyn- : Long a: = 1 
. gealized +, 1 * Half-long a: == 
, syllabic n | ~ Non-syllabic ú 3 = Variety of ə 
“or _ Simultaneous sf (but see > More rounded o> x = r-coloured a 
also under the heading « Less rounded y: 
Affricates) 
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PHONETIC ALPHABET (Revised to 1979) 


Palato- 
alveolar Palatal: Velar  Uvular | Labral- Labial- 
Palatal Velar 





VOWELS Front Back STRESS, TONE (PITCH) 
Close ' stress, placed at begin- 
ning of stressed syllable: 
, secondary stress: ` high 
Half-close level pitch, high tone: 
_ low level: ° high rising: 
, low rising : ` high falling : 
Half-open low falling: ^ rise-fall : 
“ fall-rise. 
AFFRICATES can be 
Open written as digraphs, as 





ligatures, or with slur 
marks; thus ts, tf, dz: 
ttf &: ts tj dz. 

c, y may occasionally be 
used for tf, d3. 





Unrounded 
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Accent, and vowel quality, 74, 83, 
146-9; in word, 225, 223-39; in 
connected speech, 260-79; 
variation of accentual pattern, 71, 
235-7, 291-2; influence of word 
endings, 240-1; and intonation, 
269-79 

Accentuation, teaching, Chap. 12; 
priority, 318; in high acceptability 
RP, 326; teaching method, 336 

Acoustic, aspect of speech, Chap. 3; 
spectrum, 21; features of RP 
vowels, 100; plosives, 155-7; 
affricates, 175; fricatives, 181; 
nasals, 194-6; laterals, 201-2; /r/, 
208-9; /3/, /w/, 212-13 

Advanced RP, 88 

Affricate, 32-3; phonemic status, 
acoustic features, 172-5; detail, 
175-9; affrication of plosives, 160, 
164 

Air-stream, 9 et seq.; ingressive, 
egressive, 31, 34-7 

Allophones, 45; allophonic variants, 
49-50, 293-5; in complementary 
distribution, 49; allophonic 
(narrow) notation, 56 

Alphabet, OE, 76 

Alveolar (teeth) ridge, 15; 
articulations, 15-16, 32, 151; 


plosive, 163-5; fricative, 186-9; 
nasal, 197-9; lateral, 201-6; tap, 
roll, 33, 209; instability, 239, 301— 
2 

Amplitude, 12, 25, 224 

Anglo-Norman, AN, 68n. 

Apex, of tongue, 16 

Approximant, 33-4, 150, 206-12 

Aspiration, in plosives, 151-3; voice 
onset time, 152-3 

Assessment, methods, 338-9 

Assimilation, 292-7; allophonic, 
293-5; of place of articulation, 
293, 297-300; of voice, 294, 296; 
of lip position, 294-5; phonemic, 
295-300; fortis-lenis, 296; 
labialization, 293-4, 296-7; in high 
acceptability RP, 325 

Attitudinal function, of intonation, 
279-88; importance in teaching, 
2 MSI O 

Auditory, aspect of speech, Chap. 3; 
judgments, 37; skills, 334; 
feedback, 24, 336 

[æ], OE letter, 76; /22/-/a:/, 51, 87, 
107-9; [zr] for [er], 70; /æ/, length 
of, 93, 96, 107; detail, 107-9 

[a], Cardinal, 39; [a:] > [e1], 69-70, 
78,82, 130 

/ai/, detail, 131-2; /aro/, 139-41 
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[a], Cardinal, 39; /a:/, detail, 111-15; 
/a:/ and /e/, 51-2, 87, 112-13; OE 
[az] > [au], 75, 82, 135-6 

/av/, detail, 136-8; /aua/, 139-41 


Back, of tongue, 16-17 

Back vowels, 43 

Bell, Alexander M., 67 

Bilabial articulations, 31; plosives, 
161-2; nasals, 196-7 

Blade, of tongue, 16 

Body, in intonation, 275; step up in, 
275 

Breathy voice, 12 

Broad, notation, 56—7 

/b/, detail, 161-2 

[6], 36 

[B], 15; for /b/, 159-60; for /w/, 183, 
218 


Cardinal vowel, scale, diagram, 39- 
41, 61; values in minimum general 
intelligibility model, 329-30 

Central vowels, 43 

Centre, of tongue, 16 

Centring diphthongs, 142-6 

Chart, chief English consonantal 
articulations, 35 

Chronemes, 98 

Citation forms, 326 

Clear [1], 49, 51, 202-5 

Cleft palate speech, 14, 197, 198, 200 

Clicks, 37 

Close, vowels, 43 

Closing, stage of plosives, 151 

Coalescence, of phonemes, 83, 177, 
191, 299-300 

Co-articulatory effects, 293-5 

Communication, Chap. 1 

Commutation, for consonants, 46; 
for vowels, 92-3 

Comparative adjective formation, 
257 

Complementary distribution, 49 

Compound words, accentual 
patterns, 331-3 

Compression, stage of plosives, 151 

Connected speech, Chap. 10; accent 
in, 260-79; word in, Chap. 11; 


priority, 320; in high acceptability 
RP, 325; in minimum general 
intelligibility, 333 

Conservative RP, 88 

Consonants, acoustic features, 2324; 
and vowels, 29-31, 54-5; types of, 
31-7; classification and chart, 34, 
35; in RP, 47, Chap. 8; chart, 35, 
150; plosives, 151-72; affricates, 
172-9; fricatives, 179-94; nasals, 
194-201; laterals, 201—6; /r/, 204— 
13; semi-vowels, 212-18; examples 
of change, 69-71, 83; loss of, 83; 
frequency of occurrence, 218-19; 
elision of, 239, 301-2; priority in 
teaching, 318; in high acceptability 
RP, 323-5; in minimum general 
intelligibility, 331-2; teaching 
method, 335—6; assessment, 338-9 

Content words, 261,264 

Continuant frictionless consonants, 
34, 194-218 

Continuous nature of utterance, 19, 
44, 289 

Contoid, 30, 55 

Cooper, Christopher, 65 

Creaky voice, 12, 168 

ic], 167; >[t J], 71, 77, 1741598 

[ç], 71, 193; loss of, 83; for /j/, 213- 
14 


Dark [t], 49, 51, 202-6; = [o], 49, 
206 ` 

Dental articulations, 31; plosives, 
164; fricatives, 184-6; nasals, 197— 
8 

Dependant, sound change, 70 

Description, of articulations, 18, 29; 
and classification of speech 
sounds, Chap. 4 

Development, theoretical paths of, 
75 

Devoicing, of consonants: plosives, 
153; fricatives, 180; /r/, 153, 207, 
208, 211; /1, 153; 203, 205, 20004 
153, 213-14; /w/, 153, 215-18 

Diachronic studies, 7, 66 

Dialectology, 7 

Dictionaries, early, 66 


Diphthongization, in sound changes, 
83 

Diphthongs, 41-2; RP, 92-100, 127— 
46; + [a], 139-41; falling, 128; 
centring, 142-6; rising, 94-5, 143, 
145; closing, 300 

Discriminatory skills, 334—5 

Distinctive features, 48n. 

Distinctive word accentual patterns, 
235-7 

Distribution, of /r/, 87; of /x/, 93 

Double articulation, 32, 216n. 

Duration, 26, 57; of fricative noise, 
181; of plosive closure, 156; of 
semi-vowels, 212-13 

/d/, detail, 163-5; instability of 
medial, 239; of final, 298-9, 301-2 

/d3/, phonemic status, 1724; detail, 
175-7; <[d+j], 176-7, 214, 299- 
300; <1 5), 71, 174, 177: for /3/, 191 

/dr/, phonemic status, 172—5; detail, 
177-9 

[ð], OE allophone of /0/, 71; OE 
letter, 76; /9/, detail, 1846 

[d ],36 


Early grammarians, evidence for 
reconstruction, 61—7, 79 

Early Modern English, eModE, 
68n.; reconstruction of, 78-9; 
sound system and text, 81-2 

Ejective consonants, 36 

Electromyography, 9 

Electropalatography, 18 

Elision, in word, 74, 237-9; in 
connected speech, 300-2; in high 
acceptability RP, 325 

Ellis, Alexander J., 67-8 

Elocution, 67 

Emphasis, function of intonation, 
270-1 

Epenthetic plosive, 182n., 184n., 
188, 189n. 

Evidence for reconstruction, 74-9 

Explosion, stage of plosives, 151-2 

External sound change, 70 

[e], Cardinal, 39; [e:] > [i:}, 72, 78, 
82, 102-3; /e/, detail, 106-9 

/e1/, detail, 128-30; /e1a/, 139—41 
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[€], Cardinal, 39, [e:] > [11], 72, 82, 
102 


/ea/, detail, 144—5 

[ə], spelling in OE, 76—7; result of 
weak accent, 74, 83; glides to, 93; 
/a/, detail, 125-7; diphthongs +, 
139-41; /a/-/3:/, 96, 1234; /a/, 
associated with weak accent, 94, 
125-7, 146-7, 226-7, 266 

[ai]; < [i]; 72,82, 132 

[au], 51, 72, 75, 82; [au] >/au/, 138; 
/3u/, detail, 134-6; /aua/, 139-41 

/13:/, 70; detail, 122—5; /3:/-/a/, 96, 
122-5; /3:/-/au/, 1345; /a/-/av/, 134 


Falling diphthongs, 128 

Falling nucleus, 277; rising 
reinforcement, 274; and attitude, 
281-2 

Falling-rising nucleus, 273; and 
attitude, 284 

Fashion, and change, 70 

Feedback, auditory, 27-8 

Flap articulation, 33, 209 

Focus, intonational, 271, 279-80 

Fonetik Jurnal, 67 

Foreign borrowings, and system, 70 

Formants, 21-3 

Form words, 261-2, 266—9 

Fortis articulation, 34, 150; plosives, 
152; fricatives, 179; in 
assimilation, 294, 196; as 
phonological category, 194 

Free variants, 49 

French words, distinctive accentual 
patterns, 235-7 

Frequency of occurrence, vowels, 
149; consonants, 218-19; words, 
306 

Frequency of vibration, 
fundamental, harmonics, 12, 20-5, 
a7 

Fricative articulations, 33; in RP, 
179-81; detail, 181-94; acoustic 
features, 181; for plosives, 160 

Frictionless continuants, 33; in RP, 
150, 194-212 

Front, of tongue, 16 

Front vowel, 43 
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/f/, in OE with allophone [v], 80, 83; 
detail, 181-3; <[x], 183 

Functional load of [3], 191 

Fundamental frequency, 20-5; 
laryngograph, 25 


Gemination, 158 

General RP, 88 

Gil, Alexander, 63 

Gliding vowels, 93-5, 127-8 

Glossic, 67 

Glottal articulations, stop [2], 11, 32, 
151; detail, 168-72; reinforcement 
of stops, 158-9; glottalized 
consonants, 158; fricative, /h/, 
192-4 

Glottalic airstream mechanism, 36, 
159 

Glottis, 11 

Grammatical words, 261-2, 266-9 

Graphology, 6 

Great Vowel Shift, 70, 74; summary, 
82-3 

/g/, detail, 166-8 

[g], 36 

[3], > [d3], 71, 174, 176-7; 167; for 
A 215 


Half-close, -open, vowels, 43 

Harmonics, 20 

Hart, John, 62-3, 84-5 

Head, intonation, 275 

Hearing mechanism, 26-7 

High acceptability RP, 316, 320 

Historical background, Chap. 6 

Hold, stage of plosives, 151 

Homorganic articulations, 159, 171 

Hypernasality, 14 

/h/, detail, 192—4; restricted 
occurrence, 47; and /n/, 50; 
insertion, 71, 83; OE/hn, hl, hr, 
hw/, 80, 193; loss of, in clusters, 
83; in form words, 266-7; in 
analysis of long vowels and glides 
to [a], 99 


i-mutation, [u:]>[i:], 70 
Idiolect, 52 
Implosive articulations, 36 


Incomplete plosives, 157 

Independent sound change, 69 

Ingressive air-stream, 36-7 

Instability of final alveolar 
articulations, 298-9, 301; of 
medials, 239, 301 

Intelligibility, of spoken English in 
the world, 90-1; minimum 
general, 316, 328-9; restricted, 
317 | 

Intensity, 25, 57-8, 181, 224 

Internal combinative sound change, 
70; isolative change, 69-70 

International Phonetic Alphabet, 
chart, 340-1 

Intonation, Joshua Steele, 66; 
accentual and non-accentual 
functions, 269-70; accentual 
function, 270-1; realization, 
primary accent, 270; types of 
nucleus, tail, frequency of 
occurrence, 272-9; realization, 
secondary accent, 275-7; head, 
body, 275; realization, unaccented 
syllables, 277-80; attitudinal and 
grammatical functions, 283-8; 
multi-nuclear patterns, 285-6; 
priority in teaching, 320; in high 
acceptability RP, 327-8; in 
minimum general intelligibility, 
333; teaching method, 337 

Intrusive /r/, 99, 303-4 

Isochrony, 263-4 

[1], [1], Cardinal, 39-40; [i:] > [ar], 
70, 72, 82, 131; /1:/, gliding, 93; 
/i:/-/1/, 96-7, /i:/, detail, 101-2 

/i/, detail, 103-6; and weak accent, 
104, 147, 227, 266 

hra/, detail, 142-4 


Johnson, Samuel, 66 

Jones, Dantel, Cardinal vowel 
system, 39-41, 68, 98, 142; 228 

Juncture, 304-6 

/j/, phonemic status, 94-5, 98-9, 
212-13; detail, 213-15; = [ç], 213- 
14; coalescence with alveolars, 176 


[k`], 36 


/k/, allophones, 49; detail, 166-8 


Labialization, 32, 217; of /r/, 209; in 
assimilation, 295, 297 

Labial-velar semi-vowel, 215-18 

Labio-dental articulations, 31; of /p/, 
48, 162; fricatives, 181-3; of 
nasals, 51, 196, 197; frictionless 
continuant for /r/, 209 

Language, 4-5 

Larynx, laryngoscope, 10-13; 
laryngograph, 25 

Lateral articulations, 33; in RP, with 
acoustic features, 201-2; /V/, detail, 
202-6; release of plosives, 159-60 

Length, 19, 26, 57; of vowels, 96-7; 
of diphthongs, 128; before 
plosives, 153-5; before fricatives, 
180; in word, 227; in connected 
speech, 265-9 

Lenis articulation, 34, 150; plosives, 
152; fricatives, 180; in 
assimilation, 294, 296; as 
phonological category, 194 

Lexical words, 261 

Lexicon, Lexis, 6 

Liaison, in connected speech, 302-4 

Lingual roll, /r/, 33, 209 

Linguistic approach, to syllable, 54 

Linguistic hierarchy, 46 

Linguistic units, 44-5 

Linking /r/, 114, 118, 124, 127, 141, 
144, 145, 146, 208, 258, 3024; in 
teaching, 324 

Lips, 15; positions, 15-16; in 
assimilation, 294, 297 

Lisping, 180 

Locus, 157 

Loudness, 19, 25, 58, 224, 228, 264 

Lungs, 9 

I Lt]; 49; 51; (11, 55, 201-3; [1], 
201-6, realized as vowel, 49, 204; 
[1], 153, 203, 206; OE /hl/, 80, 205; 
/V, detail, 202-6 

14], 215 


Manner of articulation, 32-3, 156-7 
Middle English, ME, 68n.; 
reconstruction of, 77-8; sound 
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system and text, 80-1 

Minimal pairs, 47 

Minimum general intelligibility, 316, 
328-9 

Minus transition, 156 

Models of pronunciation, 313; the 
native speaker, 313-14; the 
foreign learner, 314; choice of 
basic model, 314-15; performance 
targets, 316-17; high acceptability 
RP, 316, 320; minimum general 
intelligibility, 316, 328-9; 
restricted intelligibility, 317; 
assessment, 338-9 

Modified, regional pronunciation, 88 

Morpheme, 46; variations at 
boundaries, 292-3, 306 

Morphemics, morphophonemics, 6- 
a 

Mouth cavity, 14-18 

Multi-nuclear intonation patterns, 
285-6 

/m/, detail, 196-7 

[m], neutralization of /m/ — /n/, 50-1, 
197, 198, 294 


Narrow, notation, 56 

Nasal articulations, 13-14, 33; 
neutralization, 50-1; in RP, 194-6; 
acoustic features, 195; detail RP 
nasals, 196-201; nasal release of 
plosives, 159 

Nasal cavity, 14 

Nasalization, 14; in assimilation, 
295, 296-7 

Nasalized vowels, 41, 295 

Neutralization, of phonemes, 50-1, 
197, 198; of weak forms, 290-1; 
through assimilation, 300 

Newton, Isaac, 63 

Noise component, in consonants, 20, 
234, 30, 150, 155-6, 181 

Non-rhotic accents, 87, 208 

Notation, 56-7, 67 

Nuclear accent, in word, 225; and 
intonation, 270 

Nuclear tones, intonation, types of, 
272-7; multi-nuclear patterns, 
285-6; in teaching, 336-7 
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/n/, detail, 197-9; [n], 55, 198; [in] 
for [19], 70, 200-1; OE /hn/, 80, 
198; instability of final, 298 

/n/, phonemic status, 50, 51, 72, 83; 
detail, 199-201; restricted 
occurrence, 47; stems ending in, 
257-8 

/n/, 201,215 


Obstruents, 34, 150, 151 

Off-glide, of plosives, 152 

Old English, OE, 68n. ; dialects, 
alphabet, reconstruction of, 75-7; 
sound system and text, 80 

Old French, OF, 68n. 

On-glide, of plosives, 151 

One tap [rc], 33, 209, 211 

Open vowels, 43 

Oral mainstream mechanism, 36-7 

Oral escape, 13 

Organs of speech, diagram, 11 

[o], Cardinal, 39; for [+], 49 

[9], Cardinal, 40; /9:/-/99/, 73, 934, 
117-18; /o:/, detail, 116-18 

/31/, detail, 1324; /o1a/, 139-41 

[o], Cardinal, 39; /p/-/9:/, 52, 69, 88, 
115;/p/, detail, 114-16 

[ce], Cardinal, 39 

[cs], Cardinal, 39 

[6], Cardinal, 39; in OE, 83; 
OZ 


Palaeotype, 67 

Palatal articulations, 32; semi-vowel, 
213-15 

Palatalization, 16 

Palate, hard and soft, 13-15 

Palato-alveolar articulations, 32; 
affricates, 175-7 

Palatography, 17-18 

Palsgrave, John, 61 

Paralinguistic features, 58, 286-7 

Past tense formation, 256 

Paths of development, 75 

Pharynx, 13; constriction of in /2/, 
107 

Pharyngeal airstream mechanism, 36 

Phonation, see Voice 

Phonation types, 12, 58 


Phoneme, 5, 45, 46-8, 221; RP 
consonantal, 47; diversity of 
solutions, 47-8; relevant features, 
48; neutralization of, 50-1; 
conditions of analysis, 51—2; the 
word as a basis for analysis, 52; 
notation, 56-7; changes in 
distribution, 82; RP vowels, 924; 
potentiality of oppositions, 293; 
coalescence; 293; in assimilation, 
295 

Phonetic description, 29 

Phoneticians, early, 61-3 

Phonetics and linguistics, 6-7 

Phonetic studies in England, 61-8 

Phonetic symbols, list, viti-x 

Phonographic Society, 67 

Phonology, 6 

Phonotactics, 6 

Phonotactic possibilities, 241 et seq. 

Phonotype, 67 

Pitch, 12, 19, 25, 57; and prominence 
in word, 224-5, 227; shift of pitch 
prominence, 235; in connected 
speech, 269-82 

Pitman, Isaac, 67 

Place of articulation, 31-2, 152, 179— 
80; variation, 293-4, 297 

Plosive articulations, 32; 
neutralization after /s/, 50; RP 
plosives, stages, significant 
features, 152-5; acoustic features, 
155-7; release stage, 157-61: 
detail, 161-72 

Plosive epenthesis, 188 

Plural formation, 257 

Post-alveolar articulations, 32; 
affricates, 177-9; frictionless 
continuant /r/, 206-12 

Post-nuclear secondary accent, 276- 
f; 

Pragmatics, 7, 286-7 

Pre-fortis clipping, 96, 1534, 176, 
180 

Pre-nuclear secondary accent, 2756 

Present English, PresE, 68n. 

Present participle formation, 257 

Primary accent, in word, 147, 225 er 
seg.; and intonation, 265 et seq. 


Primary articulation, 32 

Production of speech, Chap. 2; 
productive competence, 314; 
production, vowels, 335; 
consonants, 3356; accentuation, 
336; intonation, 336-7 

Prominence, 25; and syllable, 53; in 
word, 223 et seq.; shift in word, 
235; in connected speech, 255 et 
seq.; relative prominence of 
sounds, 226 

Prosodic features, 57-8 

Psycholinguistics, 7 

Pulmonic, source of energy, 9 

Pulse theory, and syllable, 53-4 

Puns, 79 

Pure vowels, 41 

Puttenham, George, 85 

/p/, detail, 161-2 

[p’], 36 

[$] for /p/, 160; 179 


Quality, sound, 204; of weakly 
accented vowels, 74, 76-7, 83, 
125-6, 146-9; and prominence in 
word, 225-7; variation in weak 
forms, 265-9 

Quantity, 26-7, 57; changes in, 71, 
83; in OE, 77; of vowels, 96-7; 
and prominence in word, 227-8; 
and rhythm, 265 


r-coloured vowels, 209 

/r/ links, 3024; in suffix formation, 
258 

Range, of intonation, 287 

Rate, of delivery, 26, 286 

Realization, 45, 49 

Received Pronunciation, RP, 84-8; a 
wider-based, 315-16; high 
acceptability, 316, 420 

Receptive skills, 314, 337 

Reconstruction, evidence for, 74-9 

Redundancy, 5-6, 290, 293, 301-2; 
importance in comprehension, 
317, 337-8 

Regional variants, 84, 86-8 

Release stage, of plosives, 151-2, 
157-61 


Index 361 


Relevant features, of phonemes, 48 

Resonators, 13, 21 

Restricted occurrence, of phonemes, 
47 

Retroflex articulations, 32; {q}, 209, 
331 

Retroflexion, 16 

Rhetoric, 67 

Rhotic accent, 87 

Rhymes, as evidence for 
reconstruction, 77, 79 

Rhythmic accent, in word, 148, 225; 
patterns, in word, 228-33; change 
of word pattern, 233-4; distinctive 
patterns, 235—7; in connected 
speech, 263-5; rhythmic group, 
263 

Rims, of tongue, 16 

Rising diphthongs, 94-5, 143, 145 

Rising nucleus, 272-3; and attitude, 
2834 

Roll articulations, 33, 209 

Rounded vowels, cardinal, 40 

Runic writing, 76, 33, 207 

[1], 32, 207-11 

[r], (trill), variant for /r/, 209; /r/, 
detail, acoustic features, 206-13; 
loss of /r/, 83, 112-13, 117, 124, 
143, 144, 146, 210; OE /hr/, 80, 
210; restricted occurrence, 47 

[r], [5], uvular articulations, 32, 209 


Salesbury, William, 62 

Secondary accent, in word, 127, 147, 
225 et seq.; in intonation, 270, 
275-7 

Secondary articulation, 32 

Segment, sound, 44-5; teaching, 
318-19, 321-5, 334-5 

Semantics, 7 

Semi-vowels, 30, 334; in RP, 213- 
18; acoustic features, 212-13 

Shaw, Bernard, 62 

Sheridan, Thomas, 66 

Short words, in RP, 92-3 

Sibilant, 257 

Simple, words, accentual patterns, 
228-30 

Simplified Spelling Society, 62 
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Smith, Thomas, 62 

Smoothing, 94, 139-41, 193, 300 

Social variants, 84, 85-6 

Sociolinguistics, 7 

Sonorant, 34, 150, 194 

Sound changes, 68-9; types of, 69- 
71; and linguistic system, 72-4 

Sounds in language, Chap. 5 

Source of energy, in speech, 9 

Special context forms, 289 

Spectrogram, 214 

Spectrum, acoustic, 21-4 

Speech, 1; physiological aspect, 
Chap. 2; speech chain, 8; speech 
mechanism, 9-18; as a continuum, 
19, 44, 221, 289; sounds and 
linguistic units, 44-5; sounds, 
acoustic and auditory aspects, 
Chap. 3; synthesis, 27 

Spelling, as evidence for 
reconstruction, 76-9; 
standardization of, 66, 84 

Spelling reformers, 62, 67 

Standards, of pronunciation, 8324; 
emergence of, 84-5; regional, 
social variants, 85-8; teaching 
models, Chap. 12 

Steele, Joshua, 66 

Step up, in intonation body, 275 

Stress, 25, 57; early changes, 71, 78; 
in word, 223-37; shift in word, 
235-7; influence of word endings, 
240-1; shift in connected speech, 
291-2; and rhythm, 263-5; and 
vowel quality, 74, 83, 125, 146-9, 
225-7 

Stress shift, 229, 291, 327 

Styles, of connected speech, 307-11 

Sulcalization, 17 

Suffix-formation rules, 256 

Superlative adjective formation, 257 

Supra-segmental features, 57 

Syllabic sounds, 55 (see also /m/, /n/, 
M) 

Syllable, 53—4; prominence, pulse 
theory, 534; linguistic approach, 
54; unaccented, in intonation, 
277-80 

Syllable boundaries, 24] 


Synchronic studies, 7, 74 

Syntax, 6 

Synthesis, speech, 27 

System, changes in linguistic, 724, 
82-3 

Sweet, Henry, 68 

/s/, with OE allophone [z], 71; /s/, 
detail, 186-9; /ns-nts/, 188; 
instability of final, 298—9 

[s`], 36 

/$/, detail, 189-91; < [s +j], 191, 214, 
298 


Tail, in intonation, 272-3 

Teaching, methods, 334 et seq. 

Teeth, 14; ridge, 15 

Tip, of tongue, 16 

Tolerances, in high acceptability RP, 
vowels, 321-3; consonants, 324-5 

Tone, 20 

Tongue, parts of, 16-17; hollowing, 
grooving, 17 

Tonic accent, in word, 147, 225 

Trachea, 10 

Transcription, broad and narrow, 
56-7, 67 

Transition, 24, 151, 157, 181, 195, 
202, 208-9, 213, 292-3 

Trill, 33, 209 

Types of sound change. 69-71 

/t/, phonetic features, 45; detail, 
163-5; instability of medial, 239, 
301-2; of final, 298-9, 301 

[1], [ts’], [tr], 36 

/t$/, phonemic status, 172-5; detail, 
175-7; /nf-ntf/, 177; < [t +j], 177, 
214, 299-300 < [c]. 71, 77, 173, 177 

/tr/, phonemic status, 172-5; detail, 
177-9 

/0/, detail, 184-6; OE /0/ with 
allophone [ð], 71; </t/, 165, 185 


Unaccented, vowels in words, 74, 
83, 146-9, 226; words in connected 
speech, 265-9; syllables and 
intonation, 271, 277-9: 
importance in teaching, 318, 326- 
7, 336 

Unrounded vowels, cardinal, 39-40 


Uvula, 14 

Uvular articulations, 17, 32, 33, 209 

[u], Cardinal, 39; /u:/, gliding, 93; 
/u:/, detail, 119-22; [u:] >[au], 69 
78, 82; /u:/-/u/, 87, 96, 119, 121 

fü], Cardinal, 40, 121 

/u/, detail, 119-20; >/a/, 111; and 
weak accent, 147, 226, 266; [u1], 
94 

/Ua/, detail, 145-6 

[ur], Cardinal, 39; in /au/, 135; in 
lau/, 138 

[y], Cardinal, 39-40; >/a/, 111; for 
/u/, 119; in/au/, 135; in /au/, 138; 
for /Y/, 204 

[y], for /g/, 160, 168, 179 


2 


Variants, free, 49 

Velar articulations, 16, 32; plosives, 
166-8; nasals, 199-201 

Velaric airstream mechanism, 36 

Velarization, 32; of [ł], 204; of /w/, 
216 

Velum, 15 

Vibrator, 11-12, 20 

Vocal folds, 10-12; frequency of 
vibration, 12, 20-1; modes of 
vibration, 12 

Vocoid, 30 

Voice, 11; in RP plosives, 153; 
fricatives, 180; nasals, 195, 196, 
197, 198; /1/, 153, 2024; /r/, 153, 
207-8, 211; //, 153, 213-15; /w/, 
153, 215-19; in assimilation, 294, 
296; voice onset time, 153; voice 
qualities, 12, 58 

Vowels, acoustic structure, 23; in 
RP, 99; and consonants, 29-31, 
54-5; type, 30, 37-8; description, 
38—43; classification, 42-3; 
relatively pure and gliding, 41-2; 
vowel harmony, 70; change, rate, 
and route, 71-2; examples of 
change, 82-3; RP phonemes, 
Chap. 7; distinctive units, 93-4, 
98-9; length, 96-7; phonemic 
interpretation, 98-9; detail, 
relatively pure, 101-27; 
diphthongal, 127-46; 


Index 363 


diphthongs + [a], 139-41; centring 
diphthongs, 142-6; under weak 
accent, 74, 83, 125-7, 146-9, 226; 
frequency of occurrence, 149; 
elision of, 238-9; strong, weak, 
226-7; priority in teaching, 318; in 
high acceptability RP, 3213; in 
minimum general intelligibility, 
329-30; teaching method, 335; 
assessment, 338-9 

/v/, detail, 182-3; [v] allophone of 
OE /f/, 71; for /w/, 218 

lu], for /r/, 209; for /w/, 218 

[A], Cardinal, 39; /a/, 75, 82, 87; 
detail, 109-11 


Walker, John, 66 

Wallis, John, 634, 167 

Weak form, 260-2, 266, 265-9; 
neutralization of, 290-1; 
importance in teaching, 318, 326, 
336 

Wilkins, John, 64-5 

Word, 46, Chap. 9; as a basis for 
phonemic analysis, 52; and 
connected speech, Part IH; and 
accent, 223-4; and prominence, 
224-8; accentual patterns, 228-33; 
pattern instability, 2334; 
distinctive patterns, 235-7; elision 
in, 237-9; phonotactic 
possibilities, 241-2; initial and 
final sequences, Tables, 243-52; 
variability of phonemic structure, 
258-9; quality and quantity in 
connected speech, 265-9; in 
connected speech, Chap. 11; 
variation of accentual patterns, 
291-2; variations of boundaries in 
assimilation, 291-2; junctural 
features, 304-6; frequency of 
occurrence, 306 

Writing, 3 

/w/, phonemic status, 94-5, 98-9, 
212-13; detail, 215-18; /w/-/m/, 51, 
215-18; [wa] >[wb, wor], 70, 79, 
115-16, 118; OE /hw/, 80, 218; 
[wr] >/r/, 210 

[m], 51-2, 215-18 
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X-ray, photography, 9, 13, 17,38,44  /z/, detail, 186-9; [z]allophone of 
[x], 71, 179; loss of, 83; allophone of OE /s/, 71; for /d/, 160 
OE /h/, 80; >[f], 183; for /k/, 160 /3/, 70; detail, 189-91; restricted 
[x°], 36 occurrence, 47; <(z+j], 191, 214; 
replaced by /d3/, 70, 191 
[y], Cardinal, 39; in OE, 70, 80; 
rencor > UN 122 
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